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3CC—..737GACCTAG7GTT3CCGGGCA 
_ .^GGCwCCTTiAGCG 

-ACA^ - 5GC5C G'*'Ci, AGi» *>jGCC3TCAAGCACi. - wCACA7CCACAC7CCGC7GCT73ACAgT.*:zn. 
■ 1 -GAA AGGA..oT. — . rtAGAC-AACCT~AAA . . . . rtCJ.CAAACCT7.CA . . .AG7TACATTCT AA7 

- . . sssaatttscaatgasc— ;aa • ■gg^tagttactcaatacatgccaaatg^atcat 

TAAA7G AAC7 G CT AC A7AG GAAAACTCAA7ATC CTSATCTTS^ , ■ IjCCCATT GA GA7T7 7; - VTC 
- •3C£raAAATT3CCCrT53TCTAAATTACCrr3^ 

■Z . -7^GACTCAGAATATC77A777<^CAATC^r77=A7ST7AAGATT3CAaATr^ 

CAAAGT3GCGCATGA-G7C7r7C77ACAG7CACGAAG7A5CA^ 

A7TA7CT^7ATCCCACCTGAAAACTATSAACCT3^^ 

- A7A7ATAGC7ATGCA*j* *rt7CACA7GGGAAUIwV wV7CCAGAAAACAGCL* 1 - -3AAGATGTCA 
CCAATCU . :TGCA3ATAATrrA7AGT3™TCACAAGGACATC3ACCT3TTATT^7CAA.GAAAGT 
77^CCATATGATATACC7CACCGAGCACGTATCATCTCTC7AArAGAAAGTG3A7GGGCACAAAA 
7C7AGATGAAAGACCATCT777»77AAAA7GTr^ 

AAG ACA7AA CT7 V »L 1 - ^AAGC 1 - - _ A777AGC7AAAGAAAACAAAU IVACAGAw . ^ : ; CAAGT 

CACCTATw - uACAAGAAGAAAATGGAA * _'.^7u Cv^ACATACC'J 1/.. AAA7CA7GGTC7 

. - C7 7AGCT7 CA.GAAAA7AG7Gsj».^.Cw-^AAA^.-.7AAGG7 

— CAAGACAA7GA- A7CTAGAAAAGCT7AAGAC : w *. 1A7TTTATGAAC , 

^^CATCAC737CCr^GAAATCACAG 7T~ 3aA7AGCACCAI "CTLiGATCTCAAASGSCTSgATT 

CAGAACG7C75CAGCCTG07A7AGCC— \GCAG7GGATCCAGAGCAAAAGG^AAGACA7^ 
CAAAT3ACAaAAG CCr GCC77AACCAC7C5CTAGATGCC L I C 7 5 > L CAGGGAC7~GA7CATGAA 
AGAGC«C7ATGAACTTGT7AG7ACCAAGCCTACAAGGACC7CAAAAGTCAflACAA77ACTAGACA 
CTACT^CATCCAAGSAGAAGAATTT^CAAAGTTATAGTA 

A7GGGTC77CAGCCrrACCCGGAAA7ACr73TGG7 

AAATAAAAGCATGTAAGT3AC73 



77AGATCACCATC 
r=AASAAGAAA7G7CT 



ATTTACT7CA 
— CATAAAAGGA7AT77ATAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA CSEQ IS 80:1) 



(57) Abstract: Novel CARD-3, CARD-4L, CARD-4S, CARD-4Y, CARD-4Z, CARD-5, and CARD-6 polypeptides, proteins, 
and nucleic acid molecules are disclosed. In addition to isolated CARD-3, CARD^*L, CARD-4S, CARD-4Y, CARD-4Z, 
CARD-5, and CARD-6 proteins, and the invention further provides CARD-3, CARD-4L, CARD-4S, CARD-4Y, CARD-4Z, 
CARD-5, and CARD-6 fusion proteins, antigenic peptides and anti-CARDS-3, anti-CARD-4L and anti-CARD-4S, anti-CARD-4Y, 
anti-CARD-4Z, anti-CARDS-5, and anti-CARD-6 antibodies. The invention also provides CARD-3, CARD-4L, CARD-4S, 
CARD-4Y, CARD-4Z. CARD-5, and CARD-6 nucleic acid molecules, recombinant expression vectors containing a nucleic acid 
molecule of the invention, host cells into which the expression vectors have been introduced and non-human transgenic animals in 
which a CARD-3, CARD-4L, CARD-4S, CARD-4Y, CARD-4Z, CARD-5, and CARD-6 gene has been introduced or disrupted. 
Diagnostic, screening and therapeutic methods utilizing compositions of the invention are also provided. 
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NOVEL MOLECULES OF THE CARD-RELATED 
PROTEIN FAMILY AND USES THEREOF 

Background of the Invention 
5 In multicellular organisms, homeostasis is maintained by balancing 

the rate of cell proliferation against the rate of cell death. Cell proliferation is 
influenced by numerous growth factors and the expression of proto-oncogenes, 
which typically encourage progression through the cell cycle. In contrast, 
numerous events, including the expression of tumor suppressor genes, can lead to 

1 0 an arrest of cellular proliferation. 

In differentiated cells, a particular type of cell death called apoptosis 
occurs when an internal suicide program is activated. This program can be 
initiated by a variety of external signals as well as signals that are generated 
within the cell in response to, for example, genetic damage. For many years, the 

15 magnitude of apoptotic cell death was not appreciated because the dying cells are 
quickly eliminated by phagocytes, without an inflammatory response. 

The mechanisms that mediate apoptosis have been intensively studied. 
These mechanisms involve the activation of endogenous proteases, loss of 
mitochondrial function, and structural changes such as disruption of the 

2 0 cytoskeleton. cell shrinkage, membrane blebbing. and nuclear condensation due 
to degradation of DNA. The various signals that trigger apoptosis are thought to 
bring about these events by converging on a common cell death pathway that is 
regulated by the expression of genes that are highly conserved from worms, such 
as C. elegans. to humans. In fact, invertebrate model systems have been 

2 5 invaluable tools in identifying and characterizing the genes that control apoptosis. 

Through the study of invertebrates and more evolved animals, numerous genes 
that are associated with cell death have been identified, but the way in which their 
products interact to execute the apoptotic program is poorly understood. 

Caspases. a class of proteins central to the apoptotic program, are 

3 0 responsible for the degradation of cellular proteins that leads to the 

morphological changes seen in cells undergoing apoptosis. Caspases are cysteine 
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proteases having specificity for aspartate at the substrate cleavage site. An 
effector caspase is activated by an initiator caspase which cleaves the effector 
caspase at specific internal aspartate residues resulting in the separation of the 
large and small subunits of the effector caspase. For example, one of the 
5 caspases identified in humans was previously known as the interleukin-la (IL- 
la) converting enzyme (ICE), a cysteine protease responsible for the processing 
of pro-IL-la to the active cytokine. Overexpression of ICE in Rat-1 fibroblasts 
induces apoptosis (Miura et al., Cell 75:653. 1993). 

Many caspases and proteins that interact with caspases possess 

1 0 domains of about 60 amino acids called a caspase recruitment domain (CARD). 
Hofmann et al. (TIBS 22:155, 1997) and others have postulated that certain 
apoptotic proteins bind to each other via their CARDs and that different subtypes 
of CARDs may confer binding specificity, regulating the activity of various 
caspases. for example. The functional significance of CARDs have been 

15 repeatedly demonstrated. For example. Duan et al. (Nature 385:86, 1997) 
showed that deleting the CARD at the N-terminus of RAIDD abolished the 
ability of RAIDD to bind to caspases. 

Caspase-9 activation may precede the activation of all other cell 
death-related caspases in the mitochondrial pathways of apoptosis (See et al.. J. 

20 Cell Biol. 144:281-292.1999). Inactive procaspase-9 is activated by interaction 
with a complex which includes Apaf-1, a CARD-containing protein, and other 
factors (Li et al., Cell 91 :479, 1997; Srinivasula et al., Mol. Cell 1 :949-959, 
1 998). Recognition of procaspase-9 by Apaf-1 occurs primarily through the 
interaction of the CARD of Apaf- 1 with the prodomain of caspase-9. The CARD 

2 5 of Apaf-1 shares about 20% sequence identity with the prodomain of procaspase- 
9. The prodomain of caspase-9 is a member of the CARD family of apoptotic 
signalling motifs (Hofmann and Bucher. Trends in Biochem. Sci. 22:155-156. 
1997). A similar domain is present in caspase activating proteins CED-4 and 
RAIDD/CRADD as well as in initiator caspases CED-3 and caspase-2/ICH-l 

30 (Duan and Dixit. Nature 385:86-89. 1997; Ahmad et al.. Cancer Res. 57:615-619. 

. 2 - 
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1997; AInemri et al., Cell 87:171, 1996). Apaf-1 can bind several other caspases, 
e.g., caspase-4 and caspase-8 (Inohara et al., J. Biol. Chem. 273:12296-12300, 
1998). 

Nuclear factor-KB (NF-tcB) is a transcription factor expressed in many 
5 cell types and which activates homologous or heterologous genes that have kB 
sites in their promoters. Molecules that regulate NF-kB activation play a critical 
role in both apoptosis and inflammation. Quiescent NF-kB resides in the 
cytoplasm as a heterodimer of proteins referred to as p50 and p65 and is 
complexed with the regulatory protein IkB. NF-kB binding to IkB causes NF-kB 

10 to remain in the cytoplasm. At least two dozen stimuli that activate NF-kB are 
known (New England Journal of Medicine 336:1066. 1997) and they include 
cytokines, protein kinase C activators, oxidants, viruses, and immune system 
stimuli. NF-kB activating stimuli activate specific IkB kinases that 
phosphorylate IkB leading to its degradation. Once liberated from IkB. NF-kB 

15 translocates to the nucleus and activates genes with kB sites in their promoters. 
The proinflammatory' cytokines TNF-a and IL-1 induce NF-kB activation by 
binding their cell-surface receptors and activating the NF-KB-inducing kinase, 
NIK. and NF-kB. NIK phosphorylates the IkB kinases a and (3 which 
phosphorylate IkB, leading to its degradation. 

2 0 NF-kB and the NF-kB pathway has been implicated in mediating 

chronic inflammation in inflammatory diseases such as asthma, ulcerative colitis, 
rheumatoid arthritis (Epstein, New England Journal of Medicine 336:1066. 1997) 
and inhibiting NF-kB or NF-kB pathways may be an effective way of treating 
these diseases. NF-kB and the NF-kB pathway has also been implicated in 

2 5 atherosclerosis (Navab et al.. American Journal of Cardiology 76:1 8C. 1995). 

especially in mediating fatty streak formation, and inhibiting NF-kB or NF-kB 
pathways may be an effective therapy for atherosclerosis. Among the genes 
activated by NF-kB are clAP-1. clAP-2. TRAF1. and TRAF2. all of which have 
been shown to protect cells from TNF-a induced cell death (Wang et al.. Science 

3 0 28 1 : 1680-83. 1998). CLAP, a protein which includes a CARD, activates the 
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Apaf-l-caspase-9 pathway and activates NF-kB by acting upstream of NIK and 
IkB kinase (Srinivasula et al., supra). 

Bcl-2 family proteins are important regulators of pathways involved in 
apoptosis and can act to inhibit or promote cell death. Expression of certain anti- 
5 apoptotic Bcl-2 family members is commonly altered in cancerous cells, 
suppressing programmed cell death and extending tumor growth. Among the 
anti-apoptotic Bcl-2 family members thus far identified are Boo, Bcl-2. Bcl-x L , 
Bcl-w, NR-13, Al. and Mcl-2. Pro-apoptotic Bcl-2 family members include Bax, 
Bak, Bad, Bik, Bid. Hrk, Bim, and Bok/Mtd. Significantly, the anti-apoptotic 

1 0 Bcl-2 family member. Bc1-xl, has been shown to interact with Apaf-1 and block 
Apaf-1 -dependent caspase-9 activation (Hu et al., Proc. Nat'l. Acad. Sci. 
95:4386-4391. 1998). Boo. another anti-apoptotic Bcl-2 family member, 
interacts with Apaf-1 and caspase-9. Bak and Bik, pro-apoptotic Bcl-2 family 
members, can disrupt the association of Boo with Apaf-1 (Song et al.. EMBO J. 

15 18:167-178, 1999). Boo is thought to be involved in the control of ovarian 
atresia and sperm maturation. Diva, another member of the Bcl-2 family, 
inhibits binding of Bcl-x L to Apf-1. preventing Bc1-xl from binding to Apaf-1. 

Neurotrophins (e.g.. NGF). which are best known as neuronal survival 
factors, can mediate apoptosis via the p75 neurotrophin receptor (p75 NTR ). It is 

2 0 thought that p75 NTR activation can lead to NF-kB activation (Carter et al.. Science 
272:542-545, 1 996). It has been proposed that p75 NTR -mediated cell death acts to 
ensure rapid cell death when a neuron is unable to obtain sufficient neurotropins. 
This mechanism could, for example, cause the elimination of neurons that reach 
an inappropriate target or that reach an appropriate target at an inappropriate time 

2 5 (Miller and Kaplan. Cell Death and Diff. 5:343-345. 1998). 
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Summary of the Invention 

The present invention is based, at least in part, on the discovery of 
genes encoding CARD-3, CARD-4, CARD-5, and CARD-6. A full-length 
human CARD-3 cDNA is presented. Several CARD-4 cDNAs are presented. 
5 Briefly, the CARD-4 gene can express a long transcript that encodes CARD-4L, 
a short transcript that encodes partial CARD-4S, or two CARD-4 splice variants 
(CARD-4Y and CARD-4Z). A full length cDNA sequence for the murine 
ortholog of CARD-4L is also presented. Full-length cDNAs encoding murine 
and human CARD-5 are presented. In addition, full-length cDNAs encoding 
1 0 human and rat CARD-6 are presented. 

CARD-3. CARD-4, CARD-5. and CARD-6 are intracellular proteins 
that are predicted to be involved in regulating caspase activation. CARD-4 is 
found to activate the NF-kB pathway and to enhance caspase 9-mediated cell 
death. In addition, proteins that bind to CARD-4 are presented including CARD- 
15 3andhNUDC. 

The CARD-3 cDNA described below (SEQ ID NO:l) has a 1620 
open reading frame (nucleotides 214 to 1833 of SEQ ID NO:l; SEQ ID NO:3) 
which encodes a 540 amino acid protein (SEQ ID NO:2). CARD-3 contains a 
kinase domain which extends from amino acid 1 to amino acid 300 of SEQ ID 
2 0 NO:2; SEQ ID NO:4. followed by a linker domain at amino acid 301 to amino 
acid 43 1 of SEQ ID NO:2; SEQ ID NO:5 and a CARD at amino acid 432 to 
amino acid 540 of SEQ ID NO:2; SEQ ID NO:6. 

At least four forms of CARD-4 exist in the cell, a long form, CARD- 
4L, a short form. CARD-4S. and two splice variants, CARD-4Y and CARD-4Z. 

2 5 The cDNA of CARD-4L described below (SEQ ID NO:7) has a 2859 nucleotide 

open reading frame (nucleotides 245-3103 of SEQ ID NO:7; SEQ ID NO:9) 
which encodes a 953 amino acid protein (SEQ ID NO:8). CARD-4L protein 
possesses a CARD domain (amino acids 15-114; SEQ ID NO: 10). The 
nucleotide sequence of the full length cDNA corresponding to the murine 

3 0 ortholog of human CARD-4L is presented (SEQ ID NO:42) as is the predicted 

- 5 - 
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amino acid sequence of murine CARD-4L (SEQ ID NO:43). A comparison 
between the predicted amino acid sequences of human CARD-4L and murine 
CARD-4L is also depicted in Figure 17. 

Human CARD-4L is also predicted to have a nucleotide binding 
5 domain which extends from about amino acid 198 to about amino acid 397 of 
SEQ ID NO:8; SEQ ID NO:l 1, a Walker Box "A", which extends from about 
amino acid 202 to about amino acid 209 of SEQ ID NO:8; SEQ ID NO: 12, a 
Walker Box "B", which extends from about amino acid 280 to about amino acid 
284, of SEQ ID NO:8; SEQ ID NO: 13, a kinase la (P-loop) subdomain, which 

10 extends from about amino acid 127 to about amino acid 212 of SEQ ID NO:8; 
SEQ ID NO:46, a kinase 2 subdomain. which extends from about amino acid 273 
to about amino acid 288 of SEQ ID NO:8; SEQ ID NO:47, a kinase 3a 
subdomain. which extends from about amino acid 327 to about amino acid 338 of 
SEQ ID NO:8; SEQ ID NO: 14, and ten Leucine-rich repeats which extend from 

15 about amino acid 674 to about amino acid 950 of SEQ ID NO: 8. The first 

Leucine-rich repeat extends from about amino acid 674 to about amino acid 701 
of SEQ ID NO:8; SEQ ID NO: 15. The second Leucine-rich repeat extends from 
about amino acid 702 to about amino acid 727 of SEQ ID NO:8; SEQ ID NO: 16. 
The third Leucine-rich repeat extends from about amino acid 728 to about amino 

2 0 acid 754 of SEQ ID NO: 8; SEQ ID NO: 17. The fourth Leucine-rich repeat 
extends from about amino acid 755 to about amino acid 782 of SEQ ID NO:8; 
SEQ ID NO:l 8. The fifth Leucine-rich repeat extends from about amino acid 
783 to about amino acid 810 of SEQ ID NO: 8; SEQ ID NO: 19. The sixth 
Leucine-rich repeat extends from about amino acid 81 1 to about amino acid 838 

25 of SEQ ID NO:8: SEQ ID NO:20. The seventh Leucine-rich repeat extends from 
about amino acid 839 to about amino acid 866 of SEQ ID NO:8; SEQ ID NO:21. 
The eighth Leucine-rich repeat extends from about amino acid 867 to about 
amino acid 894 of SEQ ID NO:8: SEQ ID NO:22. The ninth Leucine-rich repeat 
extends from about amino acid 895 to about amino acid 922 of SEQ ID NO:8: 

30 SEQ ID NO:23 and the tenth leucine-rich repeat extends from about amino acid 
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923 to about amino acid 950 of SEQ ID NO:8; SEQ ID NO:24. 

The partial cDNA of CARD-4S described below (SEQ ID NO:25) has 
a 1470 nucleotide open reading frame (nucleotides 1-1470 of SEQ ID NO:25; 
SEQ ID NO:27) which encodes a 490 amino acid protein (SEQ ID NO:26). 
5 CARD-4S protein possesses a CARD domain (amino acids 1 -74 of SEQ ID 
NO:26; SEQ ID NO:28). CARD-4S is predicted to have a P-Loop which extends 
from about amino acid 163 to about amino acid 170 of SEQ ID NO:26; SEQ ID 
NO:29. and a Walker Box "B" which extends form about amino acid 241 to about 
amino acid 245 of SEQ ID NO:26; SEQ ID NO:30. 

10 A human CARD-4Y nucleotide cDNA sequence is presented (SEQ ID 

NO:38) as is the amino acid sequence of the predicted CARD-4Y product (SEQ 
ID NO:39). A human CARD-4Z nucleotide cDNA sequence is presented (SEQ 
ID NO:40) as is the amino acid sequence of the predicted CARD-4Z product 
(SEQ ID NO:4 1 ). A comparison of the CARD-4Y, CARD-4Z. and human 

1 5 CARD-4L predicted amino acid sequences is also shown in Figure 14. 

The 761 nucleotide murine CARD-5 cDNA described below (SEQ ID 
NO:60) has a 579 nucleotide open reading frame (nucleotides 89 to 668 of SEQ 
ID NO:60; SEQ ID NO:62) which encodes a 193 amino acid protein (SEQ ID 
NO:61 ). Murine CARD-5 contains a CARD domain which extends from amino 

2 0 acid 1 1 0 to amino acid 1 79 of SEQ ID NO:6 1 (SEQ ID NO:66). 

The 740 nucleotide human CARD-5 cDNA described below (SEQ ID 
NO:48) has a 585 nucleotide open reading frame (nucleotides 54 to 639 of SEQ 
ID NO:48; SEQ ID NO:50) which encodes a 195 amino acid protein (SEQ ID 
NO:49). Human CARD-5 contains a CARD domain which extends from amino 

2 5 acid 1 1 1 to amino acid 1 8 1 of SEQ ID NO:49 (SEQ ID NO:58). 

The 5252 nucleotide rat CARD-6 cDNA described below (SEQ ID 
NO:51) has a 2715 nucleotide open reading frame (nucleotides 169 to 2883 of 
SEQ ID NO:51: SEQ ID NO:53) which encodes a 905 amino acid protein (SEQ 
ID NO:52). Rat CARD-6 contains a CARD domain which extends from amino 

3 0 acid 1 to amino acid 108 of SEQ ID NO: 52 (SEQ ID NO: 59). Rat CARD-6 also 

-7- 
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has a proline-rich C-terminus which extends from amino acid 698 to amino acid 
905 of SEQ ID NO:52 (SEQ ID NO:65). This proline-rich domain includes five 
putative SH3 binding sites. These binding sites have the sequence PXXP and are 
located at amino acids 710 to 713 (PAHP), 806 to 809 (PLRP). 819 to 822 
5 (PIPP), 857 to 860 (PPHP), and 88 1 to 884 (PSQP) of SEQ ID NO:52. 

The 4244 human CARD-6 cDNA described below (SEQ ID NO:54) 
has a 3 1 1 1 nucleotide open reading frame (nucleotides 200 to 33 1 0 of SEQ ID 
NO: 54; SEQ ID NO:56) which encodes a 1037 amino acid protein (SEQ ID 
NO: 55). Human CARD-6 includes a CARD domain which extends from amino 

10 acid 5 to amino acid 92 of SEQ ID NO:55 (SEQ ID NO:64). 

Like other proteins containing a CARD domain, CARD-3, CARD-4, 
CARD-5, and CARD-6 to participate in the network of interactions that lead to 
caspase activity. Human CARD-4L likely plays a functional role in caspase 
activation similar to that of Apaf-1 (Zou et al. (1997) Cell 90:405-413). For 

15 example, upon activation, CARD-4L binds a nucleotide, thus allowing CARD-4L 
to bind and activate a CARD-containing caspase via a CARD-CARD interaction, 
leading to apoptotic death of the cell. CARD-3. CARD-4, CARD-5, and CARD- 
6 molecules are useful as modulating agents in regulating a variety of cellular 
processes including cell growth and cell death. In one aspect, this invention 

2 0 provides isolated nucleic acid molecules encoding CARD-3. CARD-4, CARD-5. 
or CARD-6 proteins or biologically active portions thereof, as well as nucleic 
acid fragments suitable as primers or hybridization probes for the detection of 
CARD-3, CARD-4, CARD-5. or CARD-6 encoding nucleic acids. 

The invention encompasses methods of diagnosing and treating 

2 5 patients who are suffering from a disorder associated with an abnormal level or 
rate (undesirably high or undesirably low) of apoptotic cell death, abnormal 
activity of the Fas/APO-1 receptor complex, abnormal activity of the TNF 
receptor complex, or abnormal activity of a caspase by administering a 
compound that modulates the expression of CARD-3. CARD-4, CARD-5. or 

3 0 CARD-6 (at the DNA. mRNA or protein level, e.g., by altering mRNA splicing) 
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or by altering the activity of CARD-3, CARD-4, CARD-5, or CARD-6. 
Examples of such compounds include small molecules, antisense nucleic acid 
molecules, ribozymes, and polypeptides. 

Certain disorders are associated with an increased number of 
5 surviving cells, which are produced and continue to survive or proliferate when 
apoptosis is inhibited or occurs at an undesirably low rate. Compounds that 
modulate the expression or activity of CARD-3, CARD-4, CARD-5, or CARD-6 
can be used to treat or diagnose such disorders. These disorders include cancer 
(particularly follicular lymphomas, chronic myelogenous leukemia, melanoma, 

10 colon cancer, lung carcinoma, carcinomas associated with mutations in p53. and 
hormone-dependent tumors such as breast cancer, prostate cancer, and ovarian 
cancer). Such compounds can also be used to treat viral infections (such as those 
caused by herpesviruses, poxviruses, and adenoviruses). Failure to remove 
autoimmune cells that arise during development or that develop as a result of 

15 somatic mutation during an immune response can result in autoimmune disease. 
Thus, autoimmune disorders can be caused by an undesirably low levels of 
apoptosis. Accordingly, modulators of CARD-3, CARD-4. CARD-5, or CARD- 
6 activity or expression can be used to treat autoimmune disorders (e.g., systemic 
lupus erythematosis, immune-mediated glomerulonephritis, and arthritis). 

2 0 Many diseases are associated with an undesirably high rate of 

apoptosis. Modulators of CARD-3, CARD04. CARD-5, or CARD-6 expression 
or activity can be used to treat or diagnose such disorders. For example, 
populations of cells are often depleted in the event of viral infection, with perhaps 
the most dramatic example being the cell depletion caused by the human 

2 5 immunodeficiency virus (HIV). Surprisingly, most T cells that die during HIV 

infections do not appear to be infected with HIV. Although a number of 
explanations have been proposed, recent evidence suggests that stimulation of the 
CD4 receptor results in the enhanced susceptibility of uninfected T cells to 
undergo apoptosis. A wide variety of neurological diseases are characterized by 

3 0 the gradual loss of specific sets of neurons. Such disorders include Alzheimer's 

-9- 

BNSDOCID: <WO 0100B26A2_L> 



WO 01/00826 



PCT/USOO/17691 



disease. Parkinson's disease, amyotrophic lateral sclerosis (ALS) retinitis 
pigmentosa, spinal muscular atrophy, and various forms of cerebellar 
degeneration. The cell loss in these diseases does not induce an inflammatory 
response, and apoptosis appears to be the mechanism of cell death. In addition, a 
5 number of hematologic diseases are associated with a decreased production of 
blood cells. These disorders include anemia associated with chronic disease, 
aplastic anemia, chronic neutropenia, and the myelodysplastic syndromes. 
Disorders of blood cell production, such as myelodysplastic syndrome and some 
forms of aplastic anemia, are associated with increased apoptotic cell death 

1 0 within the bone marrow. These disorders could result from the activation of 
genes that promote apoptosis, acquired deficiencies in stromal cells or 
hematopoietic survival factors, or the direct effects of toxins and mediators of 
immune responses. Two common disorders associated with cell death are 
myocardial infarctions and stroke. In both disorders, cells within the central area 

15 of ischemia, which is produced in the event of acute loss of blood flow, appear to 
die rapidly as a result of necrosis. However, outside the central ischemic zone, 
cells die over a more protracted time period and morphologically appear to die by 
apoptosis. 

Proteins containing a CARD domain are thought to be involved in 
2 0 various inflammatory disorders. Accordingly, CARD-3. CARD-4. CARD-5. and 
CARD-6 polypeptides, nucleic acids and modulators of CARD-3. CARD-4. 
CARD-5, or CARD-6 expression or activity can be used to treat immune 
disorders. Such immune disorders include, but are not limited to, chronic 
inflammatory diseases and disorders, such as Crohn's disease, reactive arthritis, 

2 5 including Lyme disease, insulin-dependent diabetes, organ-specific 

autoimmunity, including multiple sclerosis, Hashimoto's thyroiditis and Grave's 
disease, contact dermatitis, psoriasis, graft rejection, graft versus host disease, 
sarcoidosis, atopic conditions, such as asthma and allergy, including allergic 
rhinitis, gastrointestinal allergies, including food allergies, eosinophilia. 

3 0 conjunctivitis, glomerular nephritis, certain pathogen susceptibilities such as 
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helminthic (e.g., leishmaniasis), certain viral infections, including HIV, and 
bacterial infections, including tuberculosis and lepromatous leprosy. 

In addition to the aforementioned disorders, CARD-3, CARD-4, 
CARD-5, and CARD-6 polypeptides, nucleic acids, and modulators of CARD-3, 
5 CARD-4, CARD-5 or CARD-6 expression or activity can be used to treat 

disorders of cell signaling and disorders of tissues in which CARD-3. CARD-4, 
CARD-5 or CARD-6 is expressed. 

The invention features a nucleic acid molecule which is at least 45% 
(or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the nucleotide sequence 

10 shown in SEQ ID NO:l. SEQ ID NO:3, SEQ ID NO:7, SEQ ID NO:9. SEQ 

ID:25. SEQ ID NO:27. SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
NO:48. SEQ ID NO:50. SEQ ID NO:5 1 . SEQ ID NO:53, SEQ ID NO:54, SEQ 
ID NO:56, SEQ ID NO:60. SEQ ID NO:62. the nucleotide sequence of the 
cDNA insert of the plasmid deposited with the ATCC as Accession Number 

1 5 203037 (the "cDNA of ATCC 203037"). the nucleotide sequence of the cDNA 
insert of the plasmid deposited with the ATCC as Accession Number 203035 (the 
"cDNA of ATCC 203035"), the nucleotide sequence of the cDNA insert of the 
plasmid deposited with the ATCC as Accession Number 203036 (the "cDNA of 
ATCC 203036"), the nucleotide sequence of the cDNA insert of the plasmid 

2 0 deposited with the ATCC as Accession Number PTA-2 1 1 (the "cDNA of ATCC 
PTA-21 1 "), the nucleotide sequence of the cDNA insert of the plasmid deposited 
with the ATCC as Accession Number PTA-2 12 ("the cDNA of ATCC PTA- 
21 2"). the nucleotide sequence of the cDNA insert of the plasmid deposited with 
the ATCC as Accession Number PTA-2 13 (the "cDNA of ATCC PTA-2 13"), or 

2 5 a complement thereof. 

The invention features a nucleic acid molecule which includes a 
fragment of at least 150 (300. 325. 350, 375. 400. 425. 450. 500. 550. 600. 650. 
700. 800. 900. 1000. 1300. 1600 or 1931) nucleotides of the nucleotide sequence 
shown in SEQ ID NO: 1 . or SEQ ID NO:3. or the nucleotide sequence of the 

3 0 cDN A ATCC 203037. or a complement thereof. 
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The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350, 400, 450, 500. 550, 600, 650, 700, 800, 900, 1000, 
1300. 1600, 1900. 2100, 2400, 2700, 3000, or 3382) nucleotides of the nucleotide 
sequence shown in SEQ ID NO:7, SEQ ID NO:9, or the nucleotide sequence of 
5 the cDNA ATCC 203035, or a complement thereof. 

Also within the invention is a nucleic acid molecule which includes a 
fragment of at least 150 (350. 400. 450. 500. 550. 600. 650. 700, 800. 900, 1000. 
1300. 1600. 1900. 2100, 2400, 2700, and 3080) nucleotides of the nucleotide 
sequence shown in SEQ ID NO:25, SEQ ID NO:27. SEQ ID NO:38. SEQ ID 
10 NO:40. or the nucleotide sequence of the cDNA of ATCC 203036. or a 
complement thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350, 400, 450, 500, 550. 600, 650, 700, and 761) 
nucleotides of the nucleotide sequence shown in SEQ ID NO:60, SEQ ID NO:62, 
15 or the nucleotide sequence of the cDNA of ATCC PTA-212, or a complement 
thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350. 400. 450, 500. 550, 600, 650, 700. and 740) 
nucleotides of the nucleotide sequence shown in SEQ ID NO:48. SEQ ID NO:50. 
2 0 the cDNA of ATCC PTA-2 1 3. or a complement thereof 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350. 400, 450, 500. 600, 700, 800. 900, 1000, 1500. 
2000. 2500, 3000, 3500, 4000, 4500, 5000, and 5252) nucleotides of the 
nucleotide sequence shown in SEQ ID NO:51. SEQ ID NO:53, or a complement 

2 5 thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (200. 300, 400. 500. 600. 700, 800, 900, 1000, 1400, 
1 800. 2200. 2600. or 3000) nucleotides of the nucleotide sequence shown in SEQ 
ID NO:54. SEQ ID NO:56. the cDNA of ATCC PTA-213. or a complement 

3 0 thereof. 
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The invention features a nucleic acid molecule which includes a 
nucleotide sequence encoding a protein having an amino acid sequence that is at 
least 45% (or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the amino acid 
sequence of SEQ ID NO:2. SEQ ID NO:8, SEQ ID NO:26. SEQ ID NO:39. SEQ 
5 ID NO:41, SEQ ID NO:43, SEQ ID NO:49. SEQ ID NO:52, SEQ ID NO:55, 
SEQ ID NO:61, or the amino acid sequence encoded by the cDNA of ATCC 
203037, the amino acid sequence encoded by the cDNA of ATCC 203035, the 
amino acid sequence encoded by the cDNA of ATCC 203036, the amino acid 
sequence encoded by the cDNA of ATCC PTA-21 1 . the amino acid sequence 

1 0 encoded by the cDNA of ATCC PTA-212, or the amino acid sequence encoded 
by the cDNA of ATCC PTA-21 3. 

In an embodiment, a CARD-3 nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO:l, or SEQ ID NO:3, or the nucleotide 
sequence of the cDNA of ATCC 203037. 

15 In another embodiment, a CARD-4L nucleic acid molecule has the 

nucleotide sequence shown in SEQ ID NO:7, or SEQ ID NO:9. or the nucleotide 
sequence of the cDNA of ATCC 203035. 

In yet another embodiment, a CARD-4S nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO:25. or SEQ ID NO:27, or the 

20 nucleotide sequence of the cDNA of ATCC 203036. In another embodiment, a 
murine CARD-4L nucleic acid molecule has the nucleotide sequence shown in 
SEQ ID NO:42. 

In another embodiment, a CARD-4Y nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO:38. 
25 In another embodiment- a CARD-4Z nucleic acid molecule has the 

nucleotide sequence shown in SEQ ID NO:40. 

In another embodiment, a human CARD-5 nucleic acid molecule has 
the nucleotide sequence shown in SEQ ID NO:48. SEQ ID NO:50 or the 
nucleotide sequence of the cDNA of ATCC PTA-21 3. In another embodiment, a 
3 0 murine CARD-5 nucleic acid molecule has the nucleotide sequence shown in 
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SEQ ID NO:60 or SEQ ID NO:62. 

In yet another embodiment, a rat CARD-6 nucleic acid molecule has 
the nucleotide sequence shown in SEQ ID NO:51 , SEQ ID NO: 5 3, or the 
nucleotide sequence of the cDN A of ATCC PTA-2 1 1 . 
5 In still another embodiment, a human CARD-6 nucleic acid molecule 

has the nucleotide sequence shown in SEQ ID NO:54. SEQ ID NO:56, or the 
nucleotide sequence of the cDNA of ATCC PTA-213. 

Also within the invention is a nucleic acid molecule which encodes a 
fragment of a polypeptide having the amino acid sequence of SEQ ID NO:2, SEQ 

10 ID NO:S, SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41 , SEQ ID NO:43, SEQ 
ID NO:49, SEQ ID NO:52, SEQ ID NO:55, SEQ ID NO:61, the fragment 
including at least 15 (25, 30. 50, 100, 150, 300. 400 or 540. 600. 700, 800. 900) 
contiguous amino acids of SEQ ID NO:2. SEQ ID NO:8, SEQ ID NO:26. SEQ 
ID NO:39. SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:49, SEQ ID NO:52, or 

1 5 SEQ ID NO:55, SEQ ID NO:61 , the polypeptide encoded by the cDNA of ATCC 
Accession Number 203037, the polypeptide encoded by the cDNA of ATCC 
Accession Number 203035, the polypeptide encoded by the cDNA of ATCC 
Accession Number 203036. the polypeptide encoded by the cDNA of ATCC 
Accession Number PTA-2 1 1. the polypeptide encoded by the cDNA of ATCC 

2 0 Accession Number PTA-2 1 2. or the polypeptide encoded by the cDNA of ATCC 
Accession Number PTA-213. 

The invention includes a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:26. SEQ ID NO:39. SEQ 

2 5 ID NO:4 1 . SEQ ID NO:43, SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55, 

SEQ ID NO:61 . or an amino acid sequence encoded by the cDNA of ATCC 
Accession Number 203037. 203035. 203036. PTA-21 1. PTA-2 12, or PTA-213, 
wherein the nucleic acid molecule hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 1 . SEQ ID NO:3, SEQ ID NO:7. SEQ ID NO:9. SEQ 

3 0 ID:25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
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NO:48, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:53. SEQ ID NO:54. SEQ 
ID NO:56. SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 203037. the 
cDNA of ATCC 203035, the cDNA of ATCC 203036, the cDNA of ATCC PTA- 
21 1, the cDNA of ATCC PTA-212, or the cDNA of PTA-213 under stringent 
5 conditions. 

In general, an allelic variant of a gene will be readily identifiable as 
mapping to the same chromosomal location as said gene. For example, in 
Example 6, the chromosomal location of the human CARD-4 gene is discovered 
to be chromosome 7 close to the SHGC-31928 genetic marker. Allelic variants 

1 0 of human CARD-4 will be readily identifiable as mapping to the human CARD-4 
locus on chromosome 7 near genetic marker SHGC-3 1 928. 

Also within the invention are: an isolated CARD-3 protein having an 
amino acid sequence that is at least about 65%, preferably 75%. 85%. 95%, or 
98% identical to the amino acid sequence of SEQ ID NO:2; an isolated CARD-3 

15 protein having an amino acid sequence that is at least about 85%, 95%, or 98% 
identical to the kinase domain of SEQ ID NO:2 (e.g., about amino acid residues 1 
to 300 of SEQ ID NO:2; SEQ ID NO:4); and an isolated CARD-3 protein having 
an amino acid sequence that is at least about 85%, 95%. or 98% identical to the 
linker domain of SEQ ID NO:2 (e.g., about amino acid residues 301 to 43 1 of 

2 0 SEQ ID NO:2; SEQ ID NO:5); an isolated CARD-3 protein having an amino acid 
sequence that is at least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO:2 (e.g.. about amino acid residues 432 to 540 of SEQ ID NO:2; 
SEQ ID NO:6); an isolated CARD-4 L protein having an amino acid sequence 
that is at least about 65%. preferably 75%. 85%. 95%. or 98% identical to the 

2 5 amino acid sequence of SEQ ID NO:8; an isolated CARD-4 L protein having an 

amino acid sequence that is at least about 85%, 95%, or 98% identical to the 
CARD domain of SEQ ID NO:8 (e.g., about amino acid residues 15 to 1 14 of 
SEQ ID NO:8; SEQ ID NO: 10); an isolated CARD-4L protein having an amino 
acid sequence that is at least about 85%. 95%. or 98% identical to the nucleotide 

3 0 binding domain of SEQ ID NO:8 (e.g.. about amino acid residues 198 to 397 of 
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SEQ ID NO:8; SEQ ID NO:l 1 ; an isolated CARD-4L protein having an amino 
acid sequence that is at least about 85%, 95%, or 98% identical to the kinase 1 a 
(P-loop) subdomain SEQ ID NO:8 (e.g.. about amino acid 127 to about amino 
acid 212 of SEQ ID NO:8; SEQ ID NO:46): an isolated CARD-4L protein having 
5 an amino acid sequence that is at least about 85%. 95%. or 98% identical to the 
kinase 2 subdomain of SEQ ID NO:8 (e.g., about amino acid 273 to about amino 
acid 288 of SEQ ID NO:8; SEQ ID NO:47); an isolated CARD-4L protein having 
an amino acid sequence that is at least about 85%, 95%. or 98% identical to a 
kinase 3a subdomain of SEQ ID NO:8 (e.g., about amino acid residues 327 to 

10 338 of SEQ ID NO:8: SEQ ID NO:14); an isolated CARD-4L protein having an 
amino acid sequence that is at least about 85%, 95%. or 98% identical to the 
Leucine-rich repeats of SEQ IDNO:8 (e.g., about amino acid residues 674 to 701 
of SEQ ID NO:8; SEQ ID NO: 15; from amino acid 702 to amino acid 727 of 
SEQ ID NO:8; SEQ ID NO: 1 6; which extends from amino acid 728 to amino 

1 5 acid 754 SEQ ID NO:8; SEQ ID NO: 1 7; from amino acid 755 to amino acid 782 
of SEQ ID NO:8; SEQ ID NO: 18; from amino acid 783 to amino acid 810 of 
SEQ ID NO:8; SEQ ID NO: 19; from amino acid 81 1 to amino acid 838 of SEQ 
ID NO:8; SEQ ID NO:20 from amino acid 839 to amino acid 866 of SEQ ID 
NO:8: SEQ ID NO:21; from amino acid 867 to amino acid 894 of SEQ ID NO:8: 

2 0 SEQ ID NO:22; from amino acid 895 to amino acid 922 of SEQ ID NO:8; SEQ 
ID NO:23; and from amino acid 923 to amino acid 950 of SEQ ID NO:8; SEQ ID 
NO:24); an isolated CARD-4S protein having an amino acid sequence that is at 
least about 65%, preferably 75%. 85%, 95%, or 98% identical to the amino acid 
sequence of SEQ ID NO:26; an isolated CARD-4S protein having an amino acid 

2 5 sequence that is at least about 85%. 95%, or 98% identical to the CARD domain 

of SEQ ID NO:26 (e.g.. about amino acid residues 1 to 74 of SEQ ID NO:26; 
SEQ ID NO:28). Also within the invention are: an isolated murine CARD-4L 
protein having an amino acid sequence that is at least about 65%, preferably 75%. 
85%. 95%, or 98% identical to the amino acid sequence of SEQ ID NO:43. Also 

3 0 within the invention are: an isolated CARD-4Y protein having an amino acid 

- 16- 

BNSOOCID: <WO 0100826A2_I_> 



WO 01/00826 



PCT/US00/17691 



sequence that is at least about 65%, preferably 75%, 85%, 95%, or 98% identical 
to the amino acid sequence of SEQ ID NO:39. Also within the invention are: an 
isolated CARD-4Z protein having an amino acid sequence that is at least about 
65%, preferably 75%, 85%, 95%, or 98% identical to the amino acid sequence of 
5 SEQIDNO:41. 

Also within the invention are: an isolated CARD-5 protein having an 
amino acid sequence that is at least about 65%, preferably 75%, 85%, 95%, or 
98% identical to the amino acid sequence of SEQ ID NO:49 and an isolated 
CARD-5 protein comprising an amino acid sequence that is at least about 90%, 
1 0 95%, or 98% identical to SEQ ID NO: 58 (CARD domain). 

Also within the invention are an isolated CARD-5 protein having an 
amino acid sequence that is at least about 65%, preferably 75%. 85%, 95%. or 
98% identical to the amino acid sequence of SEQ ID NO:60 and an isolated 
CARD-5 protein comprising an amino acid sequence that is at least about 90%, 
1 5 95%, or 98% identical to SEQ ID NO:57 (CARD domain). 

The invention also includes: an isolated CARD-6 protein having an 
amino acid sequence that is at least about 65%, preferably 75%. 85%. 95%, or 
98% identical to the amino acid sequence of SEQ ID NO:52 and an isolated 
CARD-6 protein having an amino acid sequence that is at least about 90%, 95%, 
2 0 or 98% identical to SEQ ID NO:59 (CARD domain). 

The invention also includes: an isolated CARD-6 protein having an 
amino acid sequence that is at least about 65%, preferably 75%. 85%, 95%, or 
98% identical to the amino acid sequence of SEQ ID NO:55 and an isolated 
CARD-6 protein having an amino acid sequence that is at least about 90%, 95%. 

2 5 or 98% identical to SEQ ID NO:64 (CARD domain). 

Also within the invention are: an isolated CARD-3 protein which is 
encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%. 85%, or 95% identical to SEQ ID NO:3 or the cDNA 
of ATCC 203037; an isolated CARD-3 protein which is encoded by a nucleic 

3 0 acid molecule having a nucleotide sequence at least about 65% preferably 75%, 
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85%, or 95% identical to the kinase domain encoding portion of SEQ ID NO:l 
(e.g., about nucleotides 213 to 1 1 1 3 of SEQ ID NO: 1); an isolated CARD-3 
protein which is encoded by a nucleic acid molecule having a nucleotide 
sequence at least about 65% preferably 75%, 85%, or 95% identical the linker 
5 domain encoding portion of SEQ ID NO:l (e.g., about nucleotides 1 1 14 to 1506 
of SEQ ID NO: 1); and an isolated CARD-3 protein which is encoded by a 
nucleic acid molecule having a nucleotide sequence at least about 65% preferably 
75%, 85%, or 95% identical the CARD domain encoding portion of SEQ ID 
NO:l (e.g., about nucleotides 1507 to 1833 of SEQ ID NO:l); and an isolated 

1 0 CARD-3 protein which is encoded by a nucleic acid molecule having a 

nucleotide sequence which hybridizes under stringent hybridization conditions to 
a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:3 or the 
non-coding strand of the cDNA of ATCC 203037. Also within the invention are: 
an isolated CARD-4Y protein which is encoded by a nucleic acid molecule 

15 having a nucleotide sequence that is at least about 65%, preferably 75%, 85%, or 
95% identical to SEQ ID NO:38. Also within the invention are nucleic acid 
molecules which include about nucleotides 2759 to 2842 of SEQ ID NO:7; about 
nucleotides 2843 to 2926 of SEQ ID NO:7; about nucleotides 2927 to 3010 of 
SEQ ID NO:7; about nucleotides 301 1 to 3094 of SEQ ID NO:7; and an isolated 

2 0 CARD-4L protein which is encoded by a nucleic acid molecule having a 

nucleotide sequence which hybridizes under stringent hybridization conditions to 
a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:9, or the 
non-coding strand of the cDNA of ATCC 203035. 

Also within the invention are an isolated CARD-4S protein which is 

2 5 encoded by a nucleic acid molecule having a nucleotide sequence that is at least 

about 65%. preferably 75%, 85%. or 95% identical to SEQ ID NO:27: an isolated 
CARD-3 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 65% preferably 75%. 85%. or 95% identical 
the CARD domain encoding portion of SEQ ID NO:25 (e.g.. about nucleotides 1 

3 0 to 222 of SEQ ID NO:25): an isolated CARD-3 protein which is encoded by a 
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nucleic acid molecule having a nucleotide sequence at least about 65% preferably 
75%, 85%, or 95% identical the P-Loop encoding portion of SEQ ID NO:25 
(e.g., about nucleotides 485 to 510 of SEQ ID NO:25). 

Also within the invention are an isolated CARD-5 protein which is' 
5 encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ ID NO:48 or the 
cDNA of ATCC PTA-213; an isolated CARD-5 protein which is encoded by a 
nucleic acid molecule having a nucleotide sequence at least about 90% preferably 
95%. or 98% identical to the CARD encoding portion of SEQ ID NO:48 (e.g., 

10 about nucleotides 383 to 596 of SEQ ID NO:48); and an isolated CARD-5 
protein which is encoded by a nucleic acid molecule having a nucleotide 
sequence which hybridizes under stringent hybridization conditions to a nucleic 
acid molecule having the nucleotide sequence of SEQ ID NO:48 or the non- 
coding strand of the cDNA of ATCC PTA-213. 

15 Also within the invention are an isolated CARD-5 protein which is 

encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%. 85%, or 95% identical to SEQ ID NO:60; an isolated 
CARD-5 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 90% preferably 95%, or 98% identical to the 

2 0 CARD encoding portion of SEQ ID NO:60 (e.g.. about nucleotides 416 to 625 of 
SEQ ID NO:60); and an isolated CARD-5 protein which is encoded by a nucleic 
acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:60. 

2 5 Also within the invention are an isolated CARD-6 protein which is 

encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ ID NO:5 1 ; an isolated 
CARD-6 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 90% preferably 95%. or 98% identical to the 

3 0 CARD encoding portion of SEQ ID NO:51 (e.g.. about nucleotides 169 to 456 of 
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SEQ ID NO:51); and an isolated CARD-6 protein which is encoded by a nucleic 
acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:5 1 . 
5 Also within the invention are an isolated CARD-6 protein which is 

encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ ID NO:54; an isolated 
CARD-6 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 90% preferably 95%, or 98% identical to the 

1 0 CARD encoding portion of SEQ ID NO:54; and an isolated CARD-6 protein 

which is encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:54. 

Another embodiment of the invention features CARD-3. CARD-4, 

15 CARD-5, or CARD-6 nucleic acid molecules which specifically detect CARD-3. 
CARD-4, CARD-5. or CARD-6 nucleic acid molecules, relative to nucleic acid 
molecules encoding other members of the CARD superfamily. For example, in 
one embodiment, a CARD-4L nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ 

2 0 ID NO:7, SEQ ID NO:9. or the cDNA of ATCC 203035, or a complement 

thereof. In another embodiment, the CARD-4L nucleic acid molecule is at least 
300 (350, 400, 450, 500, 550, 600. 650, 700, 800, 900, 1000, 1300. 1600, 1900, 
2100, 2400, 2700, 3000, or 3382) nucleotides in length and hybridizes under 
stringent conditions to a nucleic acid molecule comprising the nucleotide 

2 5 sequence shown in SEQ ID NO:7. SEQ ID NO:9. the cDNA of ATCC 203035, or 

a complement thereof. In another embodiment, an isolated CARD-4L nucleic 
acid molecule comprises nucleotides 287 to 586 of SEQ ID NO:7, encoding the 
CARD domain of CARD-4L, or a complement thereof. In yet another 
embodiment, the invention provides an isolated nucleic acid molecule which is 

3 0 antisense to the coding strand of a CARD-4L nucleic acid. 
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In another embodiment, a CARD-5 nucleic acid molecule hybridizes 
under stringent conditions to a nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:48. SEQ ID NO:50, or the cDNA of ATCC PTA-213, or 
a complement thereof. In another embodiment, the CARD-5 nucleic acid 
5 molecule is at least 300 (350, 400, 450, 500. 550, 585, 600, 650, 700, or 740) 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence shown in SEQ ID NO:48, SEQ ID 
NO:50, the cDNA of ATCC PTA-213, or a complement thereof. In another 
embodiment, an isolated CARD-5 nucleic acid molecule comprises nucleotides 

10 383 to 596 of SEQ ID NO:48, encoding the CARD of CARD-5. In yet another 
embodiment, the invention provides an isolated nucleic acid molecule which is 
antisense to the coding strand of a CARD-5 nucleic acid. 

Another aspect of the invention provides a vector, e.g.. a recombinant 
expression vector, comprising a CARD-3. CARD-4, CARD-5. or CARD-6 

15 nucleic acid molecule of the invention. In another embodiment the invention 
provides a host cell containing such a vector. The invention also provides a 
method for producing CARD-3. CARD-4. CARD-5. or CARD-6 protein by 
culturing, in a suitable medium, a host cell of the invention containing a 
recombinant expression vector such that a CARD-3. CARD-4. CARD-5. or 

2 0 CARD-6 protein is produced. 

Another aspect of this invention features isolated or recombinant 
CARD-3, CARD-4. CARD-5. or CARD-6 proteins and polypeptides. Preferred 
CARD-3, CARD-4. CARD-5, or CARD-6 proteins and polypeptides possess at 
least one biological activity possessed by naturally occurring human CARD-3. 

2 5 CARD-4. CARD-5. or CARD-6, e.g., (1 ) the ability to form protein :protein 

interactions with proteins in the apoptotic signaling pathway; (2) the ability to 
form CARD-CARD interactions with proteins in the apoptotic signaling pathway; 
(3) the ability to bind a CARD-3, CARD-4, CARD-5. or CARD-6 ligand; and (4) 
the ability to bind to an intracellular target. Other activities include: ( 1 ) 

3 0 modulation of cellular proliferation: (2) modulation of cellular differentiation; (3) 
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modulation of cellular death; and (4) modulation of the NF-kB pathway. 

The CARD-3, CARD-4. CARD-5. or CARD-6 proteins of the present 
invention, or biologically active portions thereof, can be operatively linked to a 
non-CARD-3, non-CARD-4, non-CARD-5, or non-CARD-6 polypeptide (e.g., 
5 heterologous amino acid sequences) to form CARD-3, CARD-4. CARD-5, or 
CARD-6 fusion proteins, respectively. The invention further features antibodies 
that specifically bind CARD-3. CARD-4. CARD-5. or CARD-6 proteins, such as 
monoclonal or polyclonal antibodies. In addition, the CARD-3. CARD-4. 
CARD-5. or CARD-6 proteins or biologically active portions thereof can be 

10 incorporated into pharmaceutical compositions, which optionally include 
pharmaceutical ly acceptable carriers. 

In another aspect, the present invention provides a method for 
detecting the presence of CARD-3, CARD-4, CARD-5. or CARD-6 activity or 
expression in a biological sample by contacting the biological sample with an 

15 agent capable of detecting an indicator of CARD-3, CARD-4, CARD-5, or 
CARD-6 activity such that the presence of CARD-3. CARD-4, CARD-5. or 
CARD-6 activity is detected in the biological sample. 

In another aspect, the invention provides a method for modulating 
CARD-3. CARD-4, CARD-5, or CARD-6 activity comprising contacting a cell 

2 0 with an agent that modulates (inhibits or stimulates) CARD-3, CARD-4, CARD- 
5. or CARD-6 activity or expression such that CARD-3, CARD-4, CARD-5, or 
CARD-6 activity or expression in the cell is modulated. In one embodiment, the 
agent is an antibody that specifically binds to CARD-3. CARD-4. CARD-5. or 
CARD-6 protein. In another embodiment, the agent modulates expression of 

2 5 CARD-3. CARD-4. CARD-5, or CARD-6 by modulating transcription of a 

CARD-3. CARD-4, CARD-5, or CARD-6 gene, splicing of a CARD-3, CARD- 

4. CARD-5. or CARD-6 mRNA, or translation of a CARD-3, CARD-4. CARD- 

5. or CARD-6 mRNA. In yet another embodiment, the agent is a nucleic acid 
molecule having a nucleotide sequence that is antisense to the coding strand of 

3 0 the CARD-3. CARD-4. CARD-5, or CARD-6 mRNA or the CARD-3, CARD-4. 
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CARD-5, or CARD-6 gene. 

In one embodiment, the methods of the present invention are used to 
treat a subject having a disorder characterized by aberrant CARD-3, CARD-4, 
CARD-5, or CARD-6 protein or nucleic acid expression or activity or related to 
5 CARD-3, CARD-4. CARD-5, or CARD-6 expression or activity by 

administering an agent which is a CARD-3. CARD-4, CARD-5. or CARD-6 
modulator to the subject. In one embodiment, the CARD-3, CARD-4, CARD-5, 
or CARD-6 modulator is a CARD-3. CARD-4. CARD-5. or CARD-6 protein. In 
another embodiment the CARD-3. CARD-4. CARD-5. or CARD-6 modulator is 

10 a CARD-3. CARD-4. CARD-5. or CARD-6 nucleic acid molecule. In other 
embodiments, the CARD-3, CARD-4, CARD-5. or CARD-6 modulator is a 
peptide, peptidomimetic. or other small molecule. 

The present invention also provides a diagnostic assay for identifying 
the presence or absence of a genetic lesion or mutation characterized by at least 

1 5 one of: (i) aberrant modification or mutation of a gene encoding a CARD-3, 

CARD-4, CARD-5, or CARD-6 protein; (ii) mis-regulation of a gene encoding a 
CARD-3, CARD-4, CARD-5. or CARD-6 protein; (iii) aberrant RNA splicing; 
and (iv) aberrant post-translational modification of a CARD-3, CARD-4. CARD- 
5. or CARD-6 protein, wherein a wild-type form of the gene encodes a protein 

2 0 with a CARD-3. CARD-4. CARD-5. or CARD-6 activity. 

In another aspect, the invention provides a method for identifying a 
compound that binds to or modulates the activity of a CARD-3. CARD-4. 
CARD-5, or CARD-6 protein. In general, such methods entail measuring a 
biological activity of a CARD-3, CARD-4. CARD-5. or CARD-6 protein in the 

2 5 presence and absence of a test compound and identifying those compounds which 

alter the activity of the CARD-3. CARD-4. CARD-5. or CARD-6 protein. 

The invention also features methods for identifying a compound 
which modulates the expression of CARD-3, CARD-4, CARD-5, or CARD-6 by 
measuring the expression of CARD-3. CARD-4, CARD-5. or CARD-6 in the 

3 0 presence and absence of a compound. 
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Other features and advantages of the invention will be apparent from 
the following detailed description and claims. 

Brief Description of the Drawings 
5 Figure 1 depicts the cDNA sequence (SEQ ID NO: 1 ) of human 

CARD-3. The open reading frame of CARD-3 (SEQ ID NO:l ) extends from 
nucleotide 213 to nucleotide 1833 nucleotide (SEQ ID NO:3). 

Figure 2 depicts the predicted amino acid sequence (SEQ ID NO:2) of 
human CARD-3. 

1 0 Figure 3 depicts the cDNA sequence (SEQ ID NO:7) of CARD-4L. 

The open reading frame of SEQ ID NO:7 extends from nucleotide 245 to 
nucleotide 3 1 03 (SEQ ID NO:9). 

Figure 4 depicts the predicted amino acid sequence (SEQ ID NO:8) of 
human CARD-4L. 

15 Figure 5 depicts the partial cDNA sequence (SEQ ID NO:25) of 

CARD-4S and the predicted amino acid sequence (SEQ ID NO:25) of human 
CARD-4S. The open reading frame of CARD-4 (SEQ ID NO:25) extends from 
nucleotide 1 to nucleotide 1470 (SEQ ID NO:27). 

Figure 6 depicts the predicted amino acid sequence (SEQ ID NO:26) 
20 of human CARD-4S. 

Figure 7 depicts an alignment of the CARD domains of CARD-4 
(SEQ ID NO: 1 0), CARD-3 (SEQ ID NO:6). ARC-CARD (SEQ ID NO:3 1), 
cIAPl-CARD (SEQ ID NO:32), and cIAP2-CARD (SEQ ID NO:33). 

Figure 8 is a plot showing predicted structural features of human 

2 5 CARD-4L. 

Figure 9 is a plot showing predicted structural features of human 

CARD-4S. 

Figure 10 depicts the cDNA sequence (SEQ ID NO:38) of the human 
CARD-4Y splice variant clone. The predicted open reading frame of the human 

3 0 CARD-4Y splice variant clone extends from nucleotide 438 to nucleotide 1 1 84. 
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Figure 1 1 depicts the amino acid sequence (SEQ ID NO: 3 9) of the 
protein predicted to be encoded by the human CARD-4Y cDNA open reading 
frame. 

Figure 12 depicts the cDNA sequence (SEQ ID NO:40) of the human 
5 CARD-4Z splice variant clone. The predicted open reading frame of the human 
CARJD-4Z splice variant clone extends from nucleotide 489 to nucleotide 980. 

Figure 13 depicts the amino acid sequence (SEQ ID NO:41) of the 
protein predicted to be encoded by the human CARD-4Z cDNA open reading 
frame. 

1 0 Figure 14 depicts an alignment of human CARD-4L (SEQ ID NO:8). 

the predicted amino acid sequence of human CARD-4Y (SEQ ID NO:39), and 
the predicted amino acid sequence of human CARD-4Z ( SEQ ID NO:41). 

Figure 1 5 depicts the nucleotide sequence of the murine CARD-4L 
cDNA (SEQ ID NO:42). 
15 Figure 16 depicts the predicted amino acid sequence of murine 

CARD-4L (SEQ ID NO:43). 

Figure 17 depicts an alignment of human CARD-4L (SEQ ID NO:8) 
and the predicted amino acid sequence of murine CARD-4L (SEQ ID NO:43). 

Figure 18 depicts a 32042 nucleotide genomic sequence of CARD-4. 
20 Figure 19 depicts the nucleotide sequence of a murine CARD-5 

cDNA (SEQ ID NO:60). The open reading frame of this cDNA extends from 
nucleotide 89 to nucleotide 667 of SEQ ID NO:60 (SEQ ID NO:62) and encodes 
a 193 amino acid protein (SEQ ID NO:61). 

Figure 20 depicts a hydropathy plot of murine CARD-5. Relatively 

2 5 hydrophobic residues are above the dashed horizontal line, and relatively 

hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

Figure 2 1 depicts the nucleotide sequence of a human CARD-5 cDNA 

3 0 (SEQ ID NO:48). The open reading frame of this cDNA extends from nucleotide 
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53 to nucleotide 638 of SEQ ID NO:48 (SEQ ID NO:50) and encodes a 195 
amino acid protein SEQ ID NO:49). 

Figure 22 depicts a hydropathy plot of human CARD-5. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
5 hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

Figure 23 depicts an alignment of the cDNA sequences of murine 
CARD-5 (SEQ ID NO:60) and human CARD-5 (SEQ ID NO:48). This 
1 C alignment was created using ALIGN (version 2.0; PAM120 scoring matrix; -12/- 
4 gap penalty). In this alignment the sequences are 68.2% identical. 

Figure 24 depicts an alignment of the amino acid sequences of murine 
CARD-5 (SEQ ID NO:61) and human CARD-5 (SEQ ID NO:49). This 
alignment was created using ALIGN (version 2.0; PAM120 scoring matrix; -12/- 
15 4 gap penalty). In this alignment the sequences are 71.8% identical. 

Figure 25 depicts the nucleotide sequence of a rat CARD-6 cDNA 
(SEQ ID NO:5 1 ). The open reading frame of this cDNA extends from nucleotide 
169 to nucleotide 2883 of SEQ ID NO:51 (SEQ ID NO:53) and encodes a 505 
amino acid protein (SEQ ID NO:52). 
2 0 Figure 26 depicts a hydropathy plot of rat CARD-6. Relatively 

hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

2 5 Figure 27 depicts an alignment of the CARD domains of murine 

CARD-5 (SEQ ID NO:57). human CARD-5 (SEQ ID NO:58). and RAIDD (SEQ 
IDNO:6l). 

Figure 28 depicts the nucleotide sequence of a human CARD-6 cDNA 
(SEQ ID NO:54). The open reading frame of this cDNA extends from nucleotide 

3 0 200 to 3310 of SEQ IDNO.54 (SEQ IDNO.56) and encodes a 1037 amino acid 
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protein (SEQ ID NO:55). 

Figure 29 depicts a hydropathy plot of human CARD-6. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. 
5 Figure 30 depicts an alignment of the CARD domain of human 

CARD-6 (SEQ ID NO:64) with a consensus CARD domain (SEQ ID NO:67). In 
this depiction of the consensus sequence, more conserved residues are indicated 
by uppercase letters and less conserved residues are indicated by lowercase 
letters. 

1 0 Figure 3 1 depicts an alignment of the CARD domains of human 

CARD-3. human CARD-4, human CARD-5, murine CARD-5, human CARD-6. 
and rat CARD-6. This alignment was created using the CJustal method with 
PAM250 residue weight table. A consensus sequence is also depicted (SEQ ID 
NO: ). 

15 

Detailed Description of the Invention 

The present invention is based, in part, on the discovery of cDNA 
molecules encoding human CARD-3, human CARD-4, partial murine CARD-4L, 
murine CARD-5, human CARD-5, rat CARD-6. and human CARD-6 proteins. 

20 
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TABLE 1 



Summary of CARD-3, CARD-4, CARD-5. and CARD-6 Sequence Information. 

5 



Gene 


cDNA 


Protein 


ORF 


Figure 


Accession 
Number 


human 
CARD-3 


SEQ ID 
NO:l 


SEQ ID 
NO:2 


SEQ ID 
NO:3 


Figs. 
1-2 


203037 


human 
CARD-4L 


SEQ ID 
NO: 7 


SEQ ID 
NO:8 


SEQ ID 

NO:9 


Figs. 
3-4 


203035 


human 
CARD-4S 


SEQ ID 
NO-.25 


SEQ ID 
NO:26 


SEQ ID 
NO:27 


Figs. 
5-6 


203036 


human 
CARD-4Y 


SEQ ID 
NO:38 


SEQ ID 
NO:39 




Figs. 
10-1 1 




human 
CARD-4Z 


SEQ ID 
NO:40 


SEQ ID 
NO:41 




Figs. 
12-13 




murine 
CARD-4L 


SEQ ID 
NO:42 


SEQ ID 

NO:43 




Figs. 
15-16 




human 
CARD-5 


SEQ ID 
NO:48 


SEQ ID 

NO:49 


SEQ ID 

NO:50 


Fig. 21 


PTA-213 


murine 
CARD-5 


SEQ ID 
N'O:60 


SEQ ID 
NO:61 


SEQ ID 
NO:62 


Fig. 19 


PTA-2 1 2 


human 
CARD-6 


SEQ ID 
NO:54 


SEQ ID 
NO:55 


SEQ ID 

NO:56 


Fig. 28 


PTA-2 1 3 


rat 

CARD-6 


SEQ ID 
NO:51 


SEQ ID 
NO:52 


SEQ ID 

NO:53 


Fig. 25 


PTA-2 1 1 



A nucleotide sequence encoding a human CARD-3 protein is shown 
in Figure 1 (SEQ ID NO: 1 ; SEQ ID NO:3 includes the open reading frame only). 
A predicted amino acid sequence of CARD-3 protein is also shown in Figure 2 

10 (SEQIDNO:2). 

CARD-4 has at least two forms, a long form, CARD-4L ? and a short 
form, CARD-4S, as well as two or more splice variants. A nucleotide sequence 
encoding a human CARD-4L protein is shown in Figure 3 (SEQ ID NO:7; SEQ 
ID NO:9 includes the open reading frame only). A predicted amino acid 

i 5 sequence of CARD-4L protein is also shown in Figure 4 (SEQ ID NO:8). A 
nucleotide sequence encoding a human CARD-4S protein is shown in Figure 5 
(SEQ ID NO:25; SEQ ID NO:27 includes the open reading frame only). A 
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predicted amino acid sequence of CARD-4S protein is shown in Figure 6 (SEQ 
ID NO:26). Two additional splice variants of human CARD-4 are provided in 
Figures 10 and 11 (human CARD-4Y) and Figures 12 and 13 (human CARD-4Z) 
(predicted amino acid sequences: SEQ ID NO:39 and SEQ ID NO:41 and nucleic 
5 acid sequences: SEQ ID NO:38 and SEQ ID NO:40). These two splice variants 
are predicted to contain 249 and 1 64 amino acids, respectively. An alignment of 
human CARD-4Y, human CARD-4Z and human CARD-4L is shown in Figure 
14. 

In addition to the human CARD-4 proteins, a full length nucleotide 
10 sequence of the murine ortholog of human CARD-4L is provided in Figure 1 5 
(SEQ ID NO:42). An alignment of murine CARD-4L with human CARD-4L is 
shown in Figure 1 7. 

A nucleotide sequence encoding a murine CARD-5 protein is shown 
in Figure 19 (SEQ ID NO:60; SEQ ID NO:62 includes the open reading frame 
15 only). A predicted amino acid sequence of murine CARD-5 protein is also 
shown in Figure 19 (SEQ ID NO:61). 

A nucleotide sequence encoding a human CARD-5 protein is shown 
in Figure 21 (SEQ ID NO:48; SEQ ID NO:50 includes the open reading frame 
only). A predicted amino acid sequence of human CARD-5 protein is also shown 
2 0 in Figure 2 1 (SEQ ID NO:49). 

A nucleotide sequence encoding a rat CARD-6 protein is shown in 
Figure 25 (SEQ ID NO:51; SEQ IDNO:53 includes the open reading frame 
only). A predicted amino acid sequence of rat CARD-6 protein is also shown in 
Figure 25 (SEQ IDNO:52). 

2 5 The human CARD-3 cDNA of Figure 1 (SEQ ID NO: 1 ). which is 

approximately 193 1 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 61 kDa (excluding post- 
translational modifications). 

A plasmid containing a cDNA encoding human CARD-3 (pXE17A) 

3 0 was deposited with the American Type Culture Collection ( ATCC), Manasass. 
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VA on May 14. 1998, and assigned Accession Number 203037. This deposit will 
be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in 
5 the art and is not an admission that a deposit is required under 35 U.S.C.§ 1 12. 

The human CARD-4L cDNA of Figure 3 (SEQ ID NO:7), which is 
approximately 3382 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 1 08 kDa (excluding post- 
translational modifications). 

10 A plasmid containing a cDNA encoding human CARD-4L (pC4Ll ) 

was deposited with the American Type Culture Collection (ATCC), Manasass, 
VA on July 7, 1998, and assigned Accession Number 203035. This deposit will 
be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 

1 5 Procedure. This deposit was made merely as a convenience for those of skill in 
the art and is not an admission that a deposit is required under 35 U.S.C.§1 12. 

The human CARD-4S cDNA of Figure 5 (SEQ ID NO:25). which is 
3082 nucleotides long including untranslated regions. 

A plasmid containing a cDNA encoding human CARD-4S (pDB33E) 

2 0 was deposited with the American Type Culture Collection (ATCC). Manasass. 
VA on May 14. 1998. and assigned Accession Number 203036. This deposit will 
be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in 

2 5 the art and is not an admission that a deposit is required under 35 U.S.C.§1 12. 

The human CARD-5 cDNA of Figure 21 (SEQ ID NO:48). which is 
approximately 740 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 21.6 kD. 

A plasmid containing a cDNA encoding human CARD-5 

3 0 (EpHC5) was deposited with the American Type Culture Collection (ATCC). 
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Manasass. VA on June 1 1, 1999. and assigned Accession Number PTA-213. 
This deposit will be maintained under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made merely as a convenience for those of 
5 skill in the art and is not an admission that a deposit is required under 35 
U.S.C.§112. 

The murine CARD-5 cDNA of Figure 19 (SEQ ID NO:60). which is 
approximately 778 nucleotides long, including untranslated regions, encodes a 
protein having a molecular weight of approximately 21 .5 kD. 

10 A plasmid containing a cDNA encoding murine CARD-5 (EpMC5) 

was deposited with the American Type Culture Collection (ATCC). Manassas, 
VA on June 11,1 999, and assigned Accession Number PTA-212. This deposit 
will be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 

15 Procedure. This deposit was made merely as a convenience for those of skill in 
the art and is not an admission that a deposit is required under 35 U.S.C.§1 12. 

The human CARD-6 cDNA of Figure 28 (SEQ ID NO:54). which is 
approximately 4244 nucleotides long encodes a protein having a molecular 
weight of approximately 1 16.5 kD (excluding post-translational modifications). 

2 0 A plasmid containing a cDNA encoding an amino terminal portion of 

human CARD-6 (EpHC6e), a plasmid encoding a carboxy terminal portion of 
human CARD-6 (EpHC6c), and a plasmid containing cDNA encoding human 
CARD-6 (EpHC6) were deposited with the American Type Culture Collection 
(ATCC), Manasass, VA on June 1 1 , 1999. and assigned Accession Number PTA- 

2 5 213. This deposit will be maintained under the terms of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purposes 
of Patent Procedure. This deposit was made merely as a convenience for those of 
skill in the art and is not an admission that a deposit is required under 35 
U.S.C,§112. 
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The rat CARD-6 cDNA of Figure 25 (SEQ ID NO: 51), which is 
approximately 5252 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 100.7 kD. 

A plasmid containing a cDNA encoding rat CARD-6 (EpMR5) was 
5 deposited with the American Type Culture Collection (ATCC), Manassas, VA. 
on June 10, 1999, and assigned Accession Number PTA-21 1. This deposit will 
be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in 
10 the art and is not an admission that a deposit is required under 35 U.S.C., 112. 

A region of human CARD-4L protein (SEQ ID NO:8). the CARD 
domain (SEQ ID NO: 10), bears some similarity to a CARD domain of CARD-3 
(SEQ ID NO:6), ARC-CARD (SEQ IDNO:31), cIAPl-CARD (SEQ ID NO:32), 
and cIAP2-CARD (SEQ ID NO:33). This comparison is depicted in Figure 7. 
15 A region, the CARD domain (SEQ ID NO:58), of human CARD-5 

protein (SEQ ID NO:48) and a region, the CARD domain (SEQ ID NO:57), of 
murine CARD-5 protein (SEQ ID NO:61) bear some similarity to the CARD of 
RAIDD (SEQ IDNO:70). This comparison is depicted in Figure 27. 

Each of CARD-3, CARD-4, CARD-5, and CARD-6 are members of a 
2 0 family of molecules (the "CARD-3 family", the "CARD-4 family", the "CARD-5 
family", and the "CARD-6 family'' respectively) having certain conserved 
structural and functional features. The term "family" when referring to the 
protein and nucleic acid molecules of the invention is intended to mean two or 
more proteins or nucleic acid molecules having a common structural domain and 

2 5 having sufficient amino acid or nucleotide sequence identity as defined herein. 

Such family members can be naturally occurring and can be from either the same 
or different species. For example, a family can contain a first protein of human 
origin and a homologue of that protein of murine origin, as well as a second, 
distinct protein of human origin and a murine homologue of that protein. 

3 0 Members of a family may also have common functional characteristics. 
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In one embodiment, a CARD-3, CARD-4, CARD-5, or CARD-6 
protein includes a CARD domain having at least about 65%, preferably at least 
about 75%, and more preferably about 85%, 95%, or 98% amino acid sequence 
identity to the CARD domain of SEQ ID NO:6 or the CARD domain of SEQ ID 
5 NO: 1 0 or the CARD domain of SEQ ID NO:28. the CARD domain of SEQ ID 
NO:57, the CARD domain of SEQ ID NO:58, the CARD domain of SEQ ID 
NO:S9, or the CARD domain of SEQ ID NO:64. 

Preferred CARD-3. CARD-4, CARD-5, or CARD-6 polypeptides of 
the present invention have an amino acid sequence sufficiently identical to the 

1 0 CARD domain amino acid sequence of SEQ ID NO:6. SEQ ID NO: 1 0, SEQ ID 
NO:57. SEQ ID NO:58, SEQ ID NO:59, and SEQ ID NO:64. respectively. 

The CARD-3 polypeptide also has an amino acid sequence 
sufficiently identical to the kinase domain sequence of SEQ ID NO:4, and an 
amino acid sequence that is sufficiently identical to the linker domain of SEQ ID 

15 NO:5. The CARD-4L polypeptide has an amino acid sequence sufficiently 
identical to the nucleotide binding domain of SEQ ID NO: 1 L an amino acid 
sequence sufficiently identical to the Walker Box "A" of SEQ ID NO: 12 or 
Walker Box "B" of SEQ ID NO: 13, an amino acid sequence sufficiently identical 
to the kinase la subdomain of SEQ ID NO:46, an amino acid sequence 

2 0 sufficiently identical to the kinase 2 subdomain of SEQ ID NO:47, or an amino 
acid sequence sufficiently identical to the kinase 3a subdomain of SEQ ID 
NO: 14. or an amino acid sequence sufficiently identical to the Leucine-rich 
repeats of SEQ ID NO: 15, SEQ ID NO: 16. SEQ ID NO: 17, SEQ ID NO: 18. SEQ 
ID NO:19. SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22. SEQ ID NO:23, and 

2 5 SEQIDNO:24. 

As used herein, the term "sufficiently identical" refers to a first amino 
acid or nucleotide sequence which contains a sufficient or minimum number of 
identical or equivalent (e.g., an amino acid residue which has a similar side 
chain) amino acid residues or nucleotides to a second amino acid or nucleotide 

3 0 sequence such that the first and second amino acid or nucleotide sequences have 
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a common structural domain and/or common functional activity. For example, 
amino acid or nucleotide sequences which contain a common structural domain 
having about 65% identity, preferably 75% identity, more preferably 85%, 95%, 
or 98% identity are defined herein as sufficiently identical. 
5 As used interchangeably herein a "CARD-3, CARD-4, CARD-5, or 

CARD-6 activity", "biological activity of CARD-3, CARD-4, CARD-5. or 
CARD-6" or "functional activity of CARD-3, CARD-4, CARD-5, or CARD-6". 
refers to an activity exerted by a CARD-3, CARD-4. CARD-5. or CARD-6 
protein, polypeptide or nucleic acid molecule on a CARD-3. CARD-4. CARD-5. 

10 or CARD-6 responsive cell as determined in vivo, or in vitro, according to 

standard techniques. A CARD-3, CARD-4, CARD-5, or CARD-6 activity can be 
a direct activity, such as an association with or an enzymatic activity on a second 
protein or an indirect activity, such as a cellular signaling activity mediated by 
interaction of the CARD-3. CARD-4, CARD-5, or CARD-6 protein with a 

15 second protein. In one embodiment, a CARD-3. CARD-4, CARD-5, or CARD-6 
activity includes at least one or more of the following activities: (i) the ability to 
interact with proteins in an apoptotic signaling pathway; (ii) the ability to interact 
with a CARD-3. CARD-4, CARD-5. or CARD-6 ligand; or (iii) the ability to 
interact with an intracellular target protein: (iv) the ability to interact, directly or 

2 0 indirectly with one or more with caspases; (v) the ability to modulate the activity 
of a caspase, e.g.. caspase-9; (vi) the ability to modulate the activity of NF-kB: 
(vii) the ability to modulate Apaf-1; (viii) the ability to modulate a Bcl-2 family 
member; (ix) the ability to modulate a neurotropin receptor, e.g., P75 NTR ; (x) the 
ability to modulate the activity of a stress activated kinase (e.g.. JNK/p38); and 

2 5 (xi) the ability to modulate phosphorylation of CHOP (GADD 1 53). For 

example, in Example 4. CARD-3 -containing proteins were shown to associate 
with CARD-4-containing proteins. In example 9, CARD-4 proteins were shown 
to induce NF-icB-mediated transcription. In example 10. CARD-3 and CARD-4 
were shown to enhance caspase-9 activity. 

30 

-34- 

BNSDOCID: <WO O100826A2J_> 



WO 01/00826 



PCT/USO0/17691 



CARD-4 and CARD-6 have Apaf-l-like sequences and may bind to 
one or more members of the Bcl-2 family (e.g., Bcl-2, Boo, or Diva). CARD-3 
and CARD-5 may also bind to one or more members fo the Bcl-2 family. 
CARD-3, CARD-4, CARD-5, and CARD-6 may modulate apoptosis by 
5 influencing the activity of a Bcl-2 family member, which modulation, in turn, 
modulates activity of Apaf-1 or other factors. CARD-3, CARD-4, CARD-5, and 
CARD-6 nucleic acid and polypetpides as well as modulators of activity of 
expression of CARD-3, CARD-4, CARD-5. or CARD-6 can be used to modulate 
an Apaf-1 signaling pathway. 

1 o CARD-3 and CARD-4 may bind to a neurolrophin receptor (e.g., 

p75 NTR ). CARD-3 and CARD-4 may modulate the activity of a neurotrophin 
receptor and thus modulate apoptosis of neuronal cells. Accordingly, CARD-3 
and CARD-4 nucleic acids and polypeptides as well as modulators of CARD-3 or 
CARD-4 activity or expression can be used to modulate apoptosis of neurons 
15 (e.g., for treatment of neurological disorders, particularly neurodegenerative 
disorders). 

Accordingly, another embodiment of the invention features isolated 
CARD-3, CARD-4. CARD-5, or CARD-6 proteins and polypeptides having a 
CARD-3. CARD-4. CARD-5. or CARD-6 activity. 

2 0 Various aspects of the invention are described in further detail in the 

following subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules 

2 5 that encode CARD-3. CARD-4, CARD-5, or CARD-6 proteins or biologically 
active portions thereof, as well as nucleic acid molecules sufficient for use as 
hybridization probes to identify CARD-3, CARD-4, CARD-5. or CARD-6- 
encoding nucleic acids (e.g., CARD-3, CARD-4. CARD-5. or CARD-6 mRNA) 
and fragments for use as PCR primers for the amplification or mutation of 

' 3 0 CARD-3. CARD-4, CARD-5, or CARD-6 nucleic acid molecules. As used 
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herein, the term "nucleic acid molecule" is intended to include DNA molecules 
(e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of 
the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule 
can be single-stranded or double-stranded, but preferably is double-stranded 
5 DNA. 

An "isolated" nucleic acid molecule is one which is separated from 
other nucleic acid molecules which are present in the natural source of the nucleic 
acid. Preferably, an "isolated" nucleic acid is free of sequences (preferably 
protein encoding sequences) which naturally flank the nucleic acid (i.e., 

1 0 sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated CARD-3, CARD-4, CARD-5. or CARD-6 nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb. 0.5 kb or 0.1 kb 
of nucleotide sequences which naturally flank the nucleic acid molecule in 

15 genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 
"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially 
free of other cellular material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors or other chemicals when 
chemically synthesized. 

20 A nucleic acid molecule of the present invention, e.g.. a nucleic acid 

molecule having the nucleotide sequence of SEQ ID NO:l . SEQ ID NO:3. SEQ 
ID NO:7, SEQ ID NO:9. SEQ ID:25. SEQ ID NO:27. SEQ ID NO:38, SEQ ID 
NO:40. SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:50. SEQ ID NO:51. SEQ 
ID NO:53. SEQ ID NO:54. SEQ ID NO:56. SEQ ID NO:60, SEQ ID NO:62. the 

25 cDNA of ATCC 203037. the cDNA of ATCC 203035. the cDNA of ATCC 
203036. the cDNA of ATCC PTA-21 1, the cDNA of ATCC PTA-212, or the 
cDNA of ATCC PTA-21 3, or a complement of any of these nucleotide 
sequences, can be isolated using standard molecular biology techniques and the 
sequence information provided herein. Using all or portion of the nucleic acid 

3 0 sequences of SEQ ID NO:l, SEQ ID NO:3. SEQ ID NO:7, SEQ ID NO:9. SEQ 
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ID:25, SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
NO:48, SEQ ID NO:50. SEQ ID N0:51, SEQ ID NO:53. SEQ ID NO:54. SEQ 
ID NO:56, SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 203037 the 
cDNA of ATCC 203035, the cDNA of ATCC 203036, the cDNA of ATCC PTA- 
5 21 1, the cDNA of ATCC PTA-212, or the cDNA of PTA-213, as a hybridization 
probe, CARD-3, CARD-4, CARD-5, or CARD-6 nucleic acid molecules can be 
isolated using standard hybridization and cloning techniques (e.g., as described in 
Sambrook et al., eds.. Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring 

10 Harbor. NY. 1989). 

A nucleic acid of the invention can be amplified using cDNA, mRNA 
or genomic DNA as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so 
amplified can be cloned into an appropriate vector and characterized by DNA 

15 sequence analysis. Furthermore, oligonucleotides corresponding to CARD-3, 
CARD-4, CARD-5, or CARD-6 nucleotide sequences can be prepared by 
standard synthetic techniques, e.g., using an automated DNA synthesizer. 

In another embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the 

2 0 nucleotide sequence shown in SEQ ID NO: 1 . SEQ ID NO:3. SEQ ID NO:7, SEQ 
ID NO:9. SEQ ID:25. SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40. SEQ ID 
NO:42. SEQ ID NO:48, SEQ ID NO:50. SEQ ID NO:51, SEQ ID NO:53, SEQ 
ID NO:54. SEQ ID NO:56, SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 
203037, the cDNA of ATCC 203035, or the cDNA of ATCC 203036, or the 

2 5 cDNA of ATCC PTA-21 1, the cDNA of PTA-212. or the cDNA of PTA-213, or 

a portion thereof. A nucleic acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently complementary to the given 
nucleotide sequence that it can hybridize to the given nucleotide sequence 
thereby forming a stable duplex. 

3 0 Moreover, the nucleic acid molecule of the invention can comprise 
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only a portion of a nucleic acid sequence encoding CARD-3, CARD-4, CARD-5, 
or CARD-6. for example, a fragment which can be used as a probe or primer or a 
fragment encoding a biologically active portion of CARD-3, CARD-4, CARD-5, 
or CARD-6. The nucleotide sequence determined from the cloning of the human 
5 CARD-3, CARD-4, CARD-5, or CARD-6, and the partial murine CARD-4 gene 
allows for the generation of probes and primers designed for use in identifying 
and/or cloning CARD-3. CARD-4. CARD-5. or CARD-6 homologues in other 
cell types, e.g., from other tissues, as well as CARD-3. CARD-4. CARD-5. or 
CARD-6 homologues and orthologs from other mammals. The probe/primer 

10 typically comprises substantially purified oligonucleotide. The oligonucleotide 
typically comprises a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably about 25, more preferably 
about 50. 75, 100, 125. 150, 175. 200, 250, 300, 350 or 400 consecutive 
nucleotides of the sense or anti-sense sequence of SEQ ID NO: 1 , SEQ ID NO:3, 

1 5 SEQ ID NO:7, SEQ ID NO:9, SEQ ID:25, SEQ ID NO:27, SEQ ID NO:38, SEQ 
ID NO:40, SEQ ID NO:42. SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:5 1 , 
SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:60. SEQ ID 
NO:62. the cDNA of ATCC 203037. the cDNA of ATCC 203035. or the cDNA 
of ATCC 203036, or the cDNA of ATCC PTA-21 1. the cDNA of PTA-212. or 

2 0 the cDNA of PTA-2 1 3. or of a naturally occurring mutant of one of SEQ ID 

NO:l, SEQ ID NO:3, SEQ ID NO:7. SEQ ID NO:9. SEQ ID:25, SEQ ID NO:27. 
SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42. SEQ ID NO:48, SEQ ID 
NO:50, SEQ ID NO:51, SEQ ID N0.53. SEQ ID N0.54. SEQ ID NO:56. SEQ 
ID NO:60, SEQ ID NO:62, the cDNA of ATCC 203037. the cDNA of ATCC 

2 5 203035. the cDNA of ATCC 203036. or the cDNA of ATCC PTA-21 1. the 

cDNA of PTA-212, or the cDNA of PTA-21 3. 

Probes based on the CARD-3. CARD-4. CARD-5, or CARD-6 
nucleotide sequence can be used to detect transcripts or genomic sequences 
encoding the same or similar proteins. The probe comprises a label group 

3 0 attached thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an 
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enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for 
identifying allelic variants and orthologs of the CARD-3, CARD-4, CARD-5, or 
CARD-6 proteins of the present invention, identifying cells or tissue which mis- 
express a CARD-3, CARD-4, CARD-5, or CARD-6 protein, such as by 
5 measuring a level of a CARD-3, CARD-4, CARD-5, or CARD-6-encoding 
nucleic acid in a sample of cells from a subject, e.g., detecting CARD-3, CARD- 
4, CARD-5, or CARD-6 mRNA levels or determining whether a genomic 
CARD-3. CARD-4, CARD-5, or CARD-6 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion" of 

1 0 CARD-3, CARD-4, CARD-5. or CARD-6 can be prepared by isolating a portion 
of SEQ ID NO: 1. SEQ ID NO:3. SEQ ID NO:7, SEQ ID NO:9. SEQ ID:25, SEQ 
ID NO:27, SEQ ID NO:38. SEQ ID NO:40. SEQ ID NO:42. SEQ ID NO:48, 
SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:53. SEQ ID NO:54. SEQ ID 
NO:56, SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 203037. the cDNA 

15 of ATCC 203035, the cDNA of ATCC 203036. or the cDNA of ATCC PTA-21 1, 
the cDNA of PTA-21 2, or the cDNA of PTA-213, which encodes a polypeptide 
having a CARD-3. CARD-4, CARD-5, or CARD-6 biological activity, 
expressing the encoded portion of CARD-3. CARD-4, CARD-5, or CARD-6 
protein (e.g., by recombinant expression in vitro) and assessing the activity of the 

2 0 encoded portion of CARD-3, CARD-4, CARD-5, or CARD-6. For example, a 
nucleic acid fragment encoding a biologically active portion of CARD-3. CARD- 
4, CARD-5, or CARD-6 includes a CARD domain, e.g., SEQ ID NO:6. SEQ ID 
NO: 1 0. SEQ ID NO:28. SEQ ID NO:57. SEQ ID NO:58. SEQ ID NO:59. or 
SEQ ID NO:62. 

2 5 The invention further encompasses nucleic acid molecules that differ 

from the nucleotide sequence of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:7. 
SEQ ID NO:9, SEQ ID:25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. 
SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:50. SEQ ID NO:5 1 . SEQ ID 
NO:53. SEQ ID NO:54. SEQ ID NO:56. SEQ ID NO:60. SEQ ID NO:62, the 

3 0 cDNA of ATCC 203037. the cDNA of ATCC 203035. the cDNA of ATCC 
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203036, or the cDNA of ATCC PTA-21 1, the cDNA of PTA-212, or the cDNA 
of PTA-213, due to degeneracy of the genetic code and thus encode the same 
CARD-3, CARD-4. CARD-5, or CARD-6 protein as that encoded by the 
nucleotide sequence shown in SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:7. SEQ 

5 ID NO:9. SEQ ID:25, SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:48, SEQ ID NO:50. SEQ ID NO:5 1 . SEQ ID NO:53, SEQ 
ID NO:54, SEQ ID NO:56, SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 

203037. the cDNA of ATCC 203035, the cDNA of ATCC 203036. or the cDNA 
of ATCC PTA-21 1, the cDNA of ATCC PTA-212, or the cDNA of ATCC PTA- 

10 213. 

In addition to the CARD-3. CARD-4. CARD-5. or CARD-6 
nucleotide sequence shown in SEQ ID NOT. SEQ ID NO:3, SEQ ID NO:7. SEQ 
ID NO:9. SEQ ID:25, SEQ ID NO:27. SEQ ID NO:38, SEQ ID NO.40. SEQ ID 
NO:42. SEQ ID NO:48. SEQ ID NO:50, SEQ ID NO:51. SEQ ID NO:53, SEQ 

15 ID NO:54, SEQ ID NO:56, SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 
203037. the cDNA of ATCC 203035, the cDNA of ATCC 203036. the cDNA of 
ATCC PTA-21 1. the cDNA of ATCC PTA-212. or the cDNA of ATCC PTA- 
21 3. ii will be appreciated by those skilled in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid sequences of CARD-3. 

20 CARD-4, CARD-5, or CARD-6 may exist within a population (e.g.. the human 
population). Such genetic polymorphism in the CARD-3, CARD-4, CARD-5, or 
CARD-6 gene may exist among individuals within a population due to natural 
allelic variation. As used herein, the terms "gene" and "recombinant gene" refer 
to nucleic acid molecules comprising an open reading frame encoding a CARD- 

2 5 3. CARD-4, CARD-5. or CARD-6 protein, preferably a mammalian CARD-3, 
CARD-4. CARD-5. or CARD-6 protein. Such natural allelic variations can 
typically result in 1-5% variance in the nucleotide sequence of the CARD-3. 
CARD-4. CARD-5. or CARD-6 gene. Any and all such nucleotide variations 
and resulting amino acid polymorphisms in CARD-3. CARD-4. CARD-5. or 

30 CARD-6 that are the result of natural allelic variation and that do not alter the 
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functional activity of CARD-3, CARD-4, CARD-5, or CARD-6 are intended to 
be within the scope of the invention. Thus, e.g., 1%, 2%, 3%, 4%, or 5% of the 
amino acids in CARD-3, CARD-4, CARD-5, or CARD-6 are replaced by another 
amino acid, preferably the amino acids are replaced by conservative substitutions. 
5 Moreover, nucleic acid molecules encoding CARD-3, CARD-4, 

CARD-5, or CARD-6 proteins from other species (CARD-3, CARD-4. CARD-5. 
or CARD-6 orthologs/homologues), which have a nucleotide sequence which 
differs from that of a CARD-3, CARD-4, CARD-5. or CARD-6 disclosed herein, 
are intended to be within the scope of the invention. 

1 0 For example, Example 5 describes the murine CARD-4 ortholog and 

Example 1 4 describes the murine CARD-5 ortholog. Nucleic acid molecules 
corresponding to natural allelic variants and homologues of the CARD-3, CARD- 
4, CARD-5, or CARD-6 cDNA of the invention can be isolated based on their 
similarity to the nucleic acids disclosed herein using the human or murine 

1 5 cDNAs, or a portion thereof, as a hybridization probe according to standard 
hybridization techniques under stringent hybridization conditions. 

In general, an allelic variant of a gene will be readily identifiable as 
mapping to the same chromosomal location as said gene. For example, in 
Example 6, the chromosomal location of the human CARD-4 gene is discovered 

2 0 to be chromosome 7 close to the SHGC-31928 genetic marker. Allelic variants 
of human CARD-4 will be readily identifiable as mapping to the human CARD-4 
locus on chromosome 7 near genetic marker SHGC-31928. 

Accordingly, in another embodiment, an isolated nucleic acid 
molecule of the invention is at least 300 (325. 350, 375, 400. 425. 450. 500, 550, 

2 5 600. 650. 700, 800, 900, 1000, 1300, 1600 or 1931) nucleotides in length and 

hybridizes under stringent conditions to the nucleic acid molecule comprising the 
nucleotide sequence, preferably the coding sequence, of SEQ ID NO:l, SEQ ID 
NO:3. or the cDNA of ATCC 203037. In yet another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 (325. 350. 375. 400. 425. 

3 0 450. 500, 550. 600. 650. 700. 800. 900. 1000. or 1300. 1640. 1900. 2200. 2500. 
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2800, 3100, or 3382) nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40, the cDNA of ATCC 203035, or 
5 the cDNA of ATCC 203036. Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 (325, 350, 375, 400, 425, 
450, 500, 550, 600, 650, 700, 800, 900, 1000, or 1300, 1640, 1900. 2200, 2500, 
2800, 3100, 3300, 3600, 3900. 4200 or 4209) nucleotides in length and 
hybridizes under stringent conditions to the nucleic acid molecule comprising the 

10 nucleotide sequence, preferably the coding sequence, of SEQ ID NO:42. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350, 400. 450. 500, 550. 600, 650, 700, or 740) 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule consisting of the nucleotide sequence of SEQ ID NO:48 or SEQ ID 

15 NO:50. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350, 400, 450, 500. 550. 600. 650. 700. or 761) 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule consisting of the nucleotide sequence of SEQ ID NO:60 or SEQ ID 
2 0 NO:62. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350, 400. 450, 500, 550, 600, 650. 700, 1000. 1500. 
2000. 2500. 3000, 3500. 4000, 4500, 5000. 5200, or 5252) nucleotides in length 
and hybridizes under stringent conditions to a nucleic acid molecule consisting of 

2 5 the nucleotide sequence of SEQ ID NO:51 or SEQ ID NO: 53. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350, 400, 450, 500, 550, 600, 650, 700, 1000, 1500, 
2000, 2500. 3000, 3500, 4000, 4500. or 5000) nucleotides in length and 
hybridizes under stringent conditions to a nucleic acid molecule consisting of the 

3 0 nucleotide sequence of SEQ ID NO:54 or SEQ ID NO:56. 
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As used herein, the term "hybridizes under stringent conditions" is 
intended to describe conditions for hybridization and washing under which 
nucleotide sequences at least 60% (65%, 70%, preferably 75%) identical to each 
other typically remain hybridized to each other. Such stringent conditions are 
5 known to those skilled in the art and can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons,N.Y. (1989), 6.3.1-6.3.6. An. 
non-limiting example of stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45DC, followed by one or 
more washes in 0.2 X SSC, 0.1% SDS at 50-65DC (e.g., 50DC or 60DC or 

1 0 65 DC). Preferably, the isolated nucleic acid molecule of the invention that 

hybridizes under stringent conditions corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule 
refers to an RNA or DNA molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

15 In addition to naturally-occurring allelic variants of the CARD-3, 

CARD-4, CARD-5, or CARD-6 sequence that may exist in the population, the 
skilled artisan will further appreciate that changes can be introduced by mutation 
into the nucleotide sequence of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID:25, SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. 

2 0 SEQ ID NO:42, SEQ ID NO:48, SEQ ID NO:50. SEQ ID NO:5 1 . SEQ ID 
NO:53, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:60, SEQ ID NO:62, the 
cDNA of ATCC 203037, the cDNA of ATCC 203035, the cDNA of ATCC 
203036, the cDNA of ATCC PTA-21 1. the cDNA of ATCC PTA-212. or the 
cDNA of ATCC PTA-21 3. thereby leading to changes in the amino acid 

2 5 sequence of the encoded protein without altering the functional ability of the 

protein. For example, one can make nucleotide substitutions leading to amino 
acid substitutions at "non-essential" amino acid residues. A "non-essential" 
amino acid residue is a residue that can be altered from the wild-type sequence of 
CARD-3, CARD-4 L/S. CARD-4 splice variant, murine CARD-4 protein, human 

3 0 CARD-5 protein, murine CARD-5 protein, or rat CARD-6 protein without 
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altering the biological activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino acid residues that are 
conserved among the CARD-3, CARD-4L/S, CARD-4 splice variant, CARD-4, 
CARD-5, or CARD-6 proteins of various species are predicted to be particularly 
5 unamenable to alteration. 

For example, preferred CARD-3. CARD-4. CARD-5. and CARD-6 
proteins of the present invention, contain at least one CARD domain. 
Additionally, a CARD-3 protein also contains at least one kinase domain or at 
least one linker domain. A CARD domain contains at least one nucleotide 

1 0 binding domain or Leucine-rich repeats. Such conserved domains are less likely 
to be amenable to mutation. Other amino acid residues, however, (e.g., those that 
are not conserved or only semi-conserved among CARD-3. CARD-4. CARD-5, 
or CARD-6 of various species) may not be essential for activity and thus are 
likely to be amenable to alteration. 

1 5 Accordingly, another aspect of the invention pertains to nucleic acid 

molecules encoding CARD-3, CARD-4, CARD-5, or CARD-6 proteins that 
contain changes in amino acid residues that are not essential for activity. Such 
CARD-3. CARD-4, CARD-5. or CARD-6 proteins differ in amino acid sequence 
from SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:25. SEQ ID NO:39. SEQ ID 

2 0 NO:4 1 . SEQ ID NO:43. SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55. or 
SEQ ID NO:61, and yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein 
that includes an amino acid sequence that is at least about 45% identical, 65%, 
75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:2, 

2 5 SEQ ID NO:8. SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41. SEQIDNO:43. 

SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55, or SEQ ID NO:61 . 

An isolated nucleic acid molecule encoding a CARD-3, CARD-4, 
CARD-5. or CARD-6 protein having a sequence which differs from that of SEQ 
ID NO: 1 . SEQ ID NO:3. SEQ ID NO:7, SEQ ID NO:9, SEQ ID:25. SEQ ID 

3 0 NO:27, SEQ ID NO:38. SEQ ID NO:40, SEQ ID NO:42. SEQ ID NO:48. SEQ 
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ID NO:50, SEQ ID NO:51, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:56, 
SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 203037, the cDNA of 
ATCC 203035, the cDNA of ATCC 203036, the cDNA of ATCC PTA-21 1, the 
cDNA of ATCC PTA-212, or the cDNA of ATCC PTA-213. can be created by 
5 introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of CARD-3 (SEQ ID NO:l, SEQ ID NO:3. the cDNA of 
ATCC 203037) or CARD-4L (SEQ ID NO:7, SEQ ID NO:9. the cDNA of ATCC 

203035) , or CARD -4 S (SEQ ID NO:25, SEQ ID NO:27, the cDNA of ATCC 

203036) , or human CARD-4 splice variants (SEQ ID NO:38, SEQ ID NO:40. or 
10 murine CARD-4 (SEQ ID NO:42), or murine CARD-5 (SEQ ID NO:60, SEQ ID 

NO:62. the cDNA of PTA-21 1 ), or human CARD-5 (SEQ ID NO:48, SEQ ID 
NO.-50. the cDNA of ATCC PTA-213), rat CARD-6 (SEQ ID NO:51, SEQ ID 
NO:53, the cDNA of ATCC PTA-21 1), or human CARD-6 (SEQ ID NO:54, 
SEQ ID NO:56, the cDNA of ATCC PTA-213) such that one or more amino acid 

1 5 substitutions, additions or deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such as site-directed 
mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino 
acid substitutions are made at one or more predicted non-essential amino acid 
residues. Thus, for example, 1%. 2%, 3%, 5%. or 10% of the amino acids can be 

2 0 replaced by conservative substitution. A "conservative amino acid substitution" 
is one in which the amino acid residue is replaced with an amino acid residue 
having a similar side chain. Families of amino acid residues having similar side 
chains have been defined in the art. These families include amino acids with 
basic side chains (e.g., lysine, arginine. histidine), acidic side chains (e.g., 

2 5 aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, 

asparagine, glutamine. serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e.g.. threonine, valine, isoleucine) and 
aromatic side chains (e.g.. tyrosine, phenylalanine, tryptophan, histidine). Thus. 

3 0 a predicted nonessential amino acid residue in CARD-3. CARD-4. CARD-5. or 
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CARD-6 is preferably replaced with another amino acid residue from the same 
side chain family. Alternatively, mutations can be introduced randomly along all 
or part of a CARD-3, CARD-4, CARD-5, or CARD-6 coding sequence, such as 
by saturation mutagenesis, and the resultant mutants can be screened for CARD- 
5 3, CARD-4, CARD-5, or CARD-6 biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantiy and the activity of the protein can be determined. 

In an embodiment, a mutant CARD-3. CARD-4, CARD-5. or CARD- 
6 protein can be assayed for: (1) the ability to form proteinrprotein interactions 

1 0 with proteins in the apoptotic signaling pathway; (2) the ability to bind a CARD- 
3, CARD-4, CARD-5, or CARD-6 ligand: or (3) the ability to bind to an 
intracellular target protein. For example, ( 1 ) in Example 7. a two-hybrid 
screening assay for the physical interaction of CARD-3 and CARD-4 is shown, 
(2) in Example 8, a two-hybrid system assay for the interaction between CARD-4 

15 and its ligand hNUDC is described, and (3) in Example 12, a 

coimmunoprecipitation assay for the interaction of CARD-3 with its ligand 
CARD-4 is shown. In yet another embodiment, a mutant CARD-3. CARD-4, 
CARD-5. or CARD-6 protein can be assayed for the ability to modulate cellular 
proliferation, cellular differentiation, or cellular death. For example, in Example 

2 0 10. assays for the regulation of cellular death (apoptosis) by CARD-3 or CARD-4 
are described. In yet another embodiment, a mutant CARD-3 or CARD-4 protein 
can be assayed for regulation of a cellular signal transduction pathway. For 
example, in Example 9, an assay for the regulation by CARD-4 of the NF-kB 
pathway is described. 

2 5 The present invention encompasses antisense nucleic acid molecules. 

i.e., molecules which are complementary to a sense nucleic acid encoding a 
protein, e.g.. complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. Accordingly, an antisense 
nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic 

3 0 acid can be complementary iO an entire CARD-3. CARD-4, CARD-5. or CARD- 
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6 coding strand, or to only a portion thereof, e.g.. all or pan of the protein coding 
region (or open reading frame). An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a nucleotide sequence 
encoding CARD-3, CARD-4, CARD-5, or CARD-6. The noncoding regions ("5' 
5 and 3' untranslated regions") are the 5' and 3' sequences which flank the coding 
region and are not translated into amino acids. Given the coding strand 
sequences encoding CARD-3, CARD-4, CARD-5, and CARD-6 disclosed 
herein, antisense nucleic acids of the invention can be designed according to the 
rules of Watson and Crick base pairing. The antisense nucleic acid molecule can 

10 be complementary to the entire coding region of CARD-3, CARD-4. CARD-5, or 
CARD-6L/S mRNA. but more preferably is an oligonucleotide which is antisense 
to only a portion of the coding or noncoding region of CARD-3. CARD-4, 
CARD-5, or CARD-6 mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of CARD-3 

15 mRNA, e.g., an oligonucleotide having the sequence 

CCCTGGTACTTGCCCCTCCGGTAG (SEQ ID NO:34) or 
CCTGGTACTTGCCCCTCC (SEQ ID NO:35) or of the CARD-4L mRNA, e.g.. 
TCGTTAAGCCCTTGAAGACAGTG (SEQ ID NO:36) and 
TCGTTAGCCCTTGAAGACCAGTGAGTGTAG (SEQ ID NO:37) or of the 

2 0 human CARD-5 mRNA, e.g., TAGGACCTCGGTACCCGCGCGCGCG (SEQ 
ID NO:68) or CGCCGGCCCC TAGGACCTCGGTACC (SEQ ID NO:69). An 
antisense oligonucleotide can be, for example, about 5, 10. 15, 20, 25, 30, 35, 40. 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis and enzymatic ligation reactions using 

2 5 procedures known in the an. For example, an antisense nucleic acid (e.g., an 

antisense oligonucleotide) can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the physical stability of the 
duplex formed between the antisense and sense nucleic acids, e.g., 

3 0 phosphorothioate derivatives and acridine substituted nucleotides can be used. 
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Examples of modified nucleotides which can be used to generate the antisense 
nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine. 5-carboxymethylaminomethyluraci], 
5 dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine. 2-methyladenine. 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine. 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2- 
thiouracil, beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5- 

10 methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), 
wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil. 2- 
thiouracil. 4-thiouracil, 5 -methyl uracil, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3aino-3-N-2-carboxypropyl) 
uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid 

1 5 can be produced biologically using an expression vector into which a nucleic acid 
has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of 
interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically 

2 0 administered to a subject or generated in situ such that they hybridize with or 
bind to cellular mRNA and/or genomic DNA encoding a CARD-3. CARD-4, 
CARD-5. or CARD-6 protein to thereby inhibit expression of the protein, e.g., by 
inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for 

2 5 example, in the case of an antisense nucleic acid molecule which binds to DNA 

duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of 
the invention include direct injection at a tissue site. Alternatively, antisense 
nucleic acid molecules can be modified to target selected cells and then 

3 0 administered systemically. For example, for systemic administration, antisense 



-48- 



WO 01/00826 



PCT/US00/17691 



molecules can be modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered 
5 to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or 
pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a- 

1 0 anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to 
the usual ft-units, the strands run parallel to each other (Gaultier et al. (1 987) 
Nucleic Acids. Res. 15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 

15 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS 
Lett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic 
RNA molecules with ribonuclease activity which are capable of cleaving a 
single-stranded nucleic acid, such as an mRNA, to which they have a 

2 0 complementary region. Thus, ribozymes (e.g.. hammerhead ribozymes 

(described in Haselhoff and Gerlach (1988) Nature 334:585-91)) can be used to 
catalytically cleave CARD-3, CARD-4, CARD-5, or CARD-6 mRNA transcripts 
to thereby inhibit translation of CARD-3. CARD-4, CARD-5, or CARD-6 
mRNA. A ribozyme having specificity for a CARD-3. CARD-4. CARD-5. or 

2 5 CARD-6-encoding nucleic acid can be designed based upon the nucleotide 

sequence of a CARD-3, CARD-4, CARD-5. or CARD-6 cDNA disclosed herein. 
For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed 
in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a CARD-3, CARD-4, CARD-5. or CARD- 

3 0 6-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987.071 : and Cech et 

-49 - 

BNSDOCIO: <WO O10O826A2J_> 



WO 01/00826 



PCT/USO0/17691 



al. U.S. Patent No. 5,11 6.742. Alternatively, CARD-3, CARD-4, CARD-5. or 
CARD-6 mRNA can be used to select a catalytic RNA having a specific 
ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel and 
Szostak(1993) Science 261:141 1-1418. 
5 The invention also encompasses nucleic acid molecules which form 

triple helical structures. For example, CARD-3, CARD-4.. CARD-5, or CARD-6 
gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of the CARD-3, CARD-4. CARD-5, or 
CARD-6 (e.g., the CARD-3, CARD-4, CARD-5, or CARD-6 promoter and/or 

1 0 enhancers) to form triple helical structures that prevent transcription of the 
CARD-3. CARD-4, CARD-5, or CARD-6 gene in target cells. See generally, 
Helene (1991) Anticancer Drug Des. 6(6): 569-84; Helene (1992) Ann. N.Y. 
Acad. Sci. 660:27-36: and Maher (1992) Bioassays 14(12):807-15. 

In embodiments, the nucleic acid molecules of the invention can be 

1 5 modified at the base moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the molecule. For example, the 
deoxyribose phosphate backbone of the nucleic acids can be modified to generate 
peptide nucleic acids (see Hyrup et al. (1996) Bioorganic & Medicinal Chemistry 
4(l):5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to 

2 0 nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate 
backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under conditions of low ionic 
strength. The synthesis of PNA oligomers can be performed using standard solid 

2 5 phase peptide synthesis protocols as described in Hyrup et al. ( 1 996) supra; 

Perry-O'Keefe et al. (1996) Proc. Natl. Acad. Sci. USA 93:14670-675. 

PNAs of CARD-3. CARD-4, CARD-5, or CARD-6 can be used for 
therapeutic and diagnostic applications. For example, PNAs can be used as 
antisense or antigene agents for sequence-specific modulation of gene expression 

3 0 by. e.g.. inducing transcription or translation arrest or inhibiting replication. 
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PNAs of CARD-3, CARD-4, CARD-5, or CARD-6 can also be used, e.g., in the 
analysis of single base pair mutations in a gene by, e.g., PNA directed PCR 
clamping; as artificial restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as probes or primers for 
5 DNA sequence and hybridization (Hyrup (1996) supra; Perry-O'Keefe et al. 
(1996) Proc. Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of CARD-3, CARD-4, CARD-5, or 
CARD-6 can be modified, e.g., to enhance their stability or cellular uptake, by 
attaching lipophilic or other helper groups to PNA. by the formation of PNA- 

10 DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras of CARD-3, CARD-4, 
CARD-5, or CARD-6 can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, 
e.g., RNAse H and DNA polymerases, to interact with the DNA portion while the 

15 PNA portion would provide high binding affinity and specificity. PNA-DNA 
chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleobases, and orientation (Hyrup 
(1996) supra). The synthesis of PNA-DNA chimeras can be performed as 
described in Hyrup (1996) supra and Finn et al. (1996) Nucleic Acids Research 

20 24(17):3357-63. For example, a DNA chain can be synthesized on a solid 
support using standard phosphoramidite coupling chemistry and modified 
nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'-deoxy-thymidine 
phosphoramidite, can be used as a between the PNA and the 5' end of DNA (Mag 
et al. (1989) Nucleic Acid Res. 17:5973-88). PNA monomers are then coupled in 

2 5 a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al. (1996) Nucleic Acids Research 24( 1 7):3357-63). 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and 
a 3' PNA segment (Peterser et al. (1975) Bioorganic Med. Chem. Lett. 5:1119- 
11124). 
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In other embodiments, the oligonucleotide may include other 
appended groups such as peptides (e.g., for targeting host cell receptors in vivo), 
or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al. 
(1989) Proc. Natl. Acad. Sci. USA 86:6553-6556; Lemaitre et al. (1987) Proc. 
5 Natl. Acad. Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or the 
blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134). In addition, 
oligonucleotides can be modified with hybridization-triggered cleavage agents 
(see. e.g.. Kiol et al. (1988) Bio/Techniques 6:958-976) or intercalating agents 
(see. e.g.. Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
1 0 may be conjugated to another molecule, e.g., a peptide, hybridization triggered 
cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

II. Isolated CARD-3, CARD-4. CARD-5. and CARD-6 Proteins and Anti- 
CARD-3. CARD-4, CARD-5, and CARD-6 Antibodies. 

1 5 One aspect of the invention pertains to isolated CARD-3, CARD-4. 

CARD-5, and CARD-6 proteins, and biologically active portions thereof, as well 
as polypeptide fragments suitable for use as immunogens to raise anti-CARD-3, 
CARD-4. CARD-5, or CARD-6 antibodies. In one embodiment, native CARD- 
3, CARD-4. CARD-5, or CARD-6 proteins can be isolated from cells or tissue 

2 0 sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, CARD-3. CARD-4, CARD-5. or CARD-6 
proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, a CARD-3. CARD-4, CARD-5. or CARD-6 protein or 
polypeptide can be synthesized chemically using standard peptide synthesis 

2 5 techniques. 

An "isolated" or "purified" protein or biologically active portion 
thereof is substantially free of cellular material or other contaminating proteins 
from the cell or tissue source from which the CARD-3. CARD-4. CARD-5, or 
CARD-6 protein is derived, or substantially free from chemical precursors or 

3 0 other chemicals when chemically synthesized. The language "substantially free 
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of cellular material" includes preparations of CARD-3, CARD-4, CARD-5, or 
CARD-6 protein in which the protein is separated from cellular components of 
the cells from which it is isolated or recombinantly produced. Thus, CARD-3. 
CARD-4, CARD-5, or CARD-6 protein that is substantially free of cellular 
5 material includes preparations of CARD-3, CARD-4, CARD-5, or CARD-6 
protein having less than about 30%, 20%, 10%. or 5% (by dry weight) of non- 
CARD-3, CARD-4, CARD-5, or CARD-6 protein (also referred to herein as a 
"contaminating protein"). When the CARD-3, CARD-4. CARD-5, or CARD-6 
protein or biologically active portion thereof is recombinantly produced, it is also 

10 preferably substantially free of culture medium, i.e., culture medium represents 
less than about 20%, 10%, or 5% of the volume of the protein preparation. When 
CARD-3, CARD-4, CARD-5. or CARD-6 protein is produced by chemical 
synthesis, it is preferably substantially free of chemical precursors or other 
chemicals, i.e., it is separated from chemical precursors or other chemicals which 

15 are involved in the synthesis of the protein. Accordingly such preparations of 
CARD-3, CARD-4, CARD-5, or CARD-6 protein have less than about 30%. 
20%. 10%. 5% (by dry weight) of chemical precursors or non-CARD-3. CARD- 
4. CARD-5, or CARD-6 chemicals. 

Biologically active portions of a CARD-3. CARD-4. CARD-5. or 

2 0 CARD-6 protein include peptides comprising amino acid sequences sufficiently 
identical to or derived from the amino acid sequence of the CARD-3, CARD-4, 
CARD-5, or CARD-6 protein (e.g., the amino acid sequence shown in SEQ ID 
NO:2, SEQ ID NO:8, SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41 . SEQ ID 
NO:43. SEQ ID NO:49. SEQ ID NO:52, SEQ ID NO:55. or SEQ ID NO:61 ), 

2 5 which include less amino acids than the full length CARD-3, CARD-4, CARD-5, 

or CARD-6 protein, and exhibit at least one activity of a CARD-3. CARD-4, 
CARD-5, or CARD-6 protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the CARD-3, CARD-4, CARD-5. or 
CARD-6 protein. A biologically active portion of a CARD-3, CARD-4. CARD- 

3 0 5. or CARD-6 protein can be a polypeptide which is. for example. 10. 25. 50. 100 
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or more amino acids in length. Preferred biologically active polypeptides include 
one or more identified CARD-3, CARD-4, CARD-5, or CARDS structural 
domains, e.g., the CARD domain (SEQ ID NO:6, SEQ ID NO: 10, SEQ ID 
NO:27, SEQ ID NO:66, SEQ ID NO:67, or SEQ ID NO:68). 
5 Moreover, other biologically active portions, in which other regions of 

the protein are deleted, can be prepared by recombinant techniques and evaluated 
for one or more of the functional activities of a native CARD-3, CARD-4, 
CARD-5, or CARD-6 protein. 

CARD-3. CARD-4. CARD-5, or CARD-6 protein has the amino acid 

1 0 sequence shown of SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:26. SEQ ID 

NO:39. SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:49. SEQ ID NO:52, SEQ 
ID NO:55, or SEQ ID NO:61. Other useful CARD-3, CARD-4, CARD-5, or 
CARD-6 proteins are substantially identical to SEQ ID NO:2, SEQ ID NO:8. 
SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID 

1 5 NO:49, SEQ ID NO:52, or SEQ ID NO:55, or SEQ ID NO:6 1 , and retain the 
functional activity of the protein of SEQ ID NO:2. SEQ ID NO:8. SEQ ID 
NO:26, SEQ ID NO:39. SEQ ID NO:41. SEQ ID NO:43. SEQ ID NO:49. SEQ 
ID NO:52, SEQ ID NO:55, or SEQ ID NO:6l, yet differ in amino acid sequence 
due to natural allelic variation or mutagenesis. CARD-3 and CARD-4 are 

2 0 involved in activating caspases in the apoptotic pathway. For example, in 
Example 10, CARD-4 is shown to enhance caspase 9 activity. 

A useful CARD-3, CARD-4. CARD-5. or CARD-6 protein is a 
protein which includes an amino acid sequence at least about 45%. preferably 
55%. 65%, 75%, 85%. 95%. or 99% identical to the amino acid sequence of SEQ 

2 5 ID NO:2. SEQ ID NO:8, SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41. SEQ 

ID NO:43. SEQ ID NO:49, SEQ ID NO:52. SEQ ID NO:55, or SEQ ID NO:61 , 
and retains the functional activity of the CARD-3, CARD-4, CARD-5. or CARD- 
6 proteins of SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:26, SEQ ID NO:39. 
SEQ ID NO:41. SEQ IDNO:43. SEQ IDNO:49. SEQ ID NO:52. SEQ ID 

3 0 NO:55.orSEQIDNO:61. 
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To determine the percent identity of two amino acid sequences or of 
two nucleic acids, the sequences are aligned for optimal comparison purposes 
(e.g., gaps can be introduced in the sequence of a first amino acid or nucleic acid 
sequence for optimal alignment with a second amino or nucleic acid sequence). 
5 The amino acid residues or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position in the first sequence is 
occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position. 
The percent identity between the two sequences is a function of the number of 

1 0 identical positions shared by the sequences (i.e., % identity = # of identical 
positions/total # of positions x 100). 

The determination of percent homology between two sequences can 
be accomplished using a mathematical algorithm. A preferred, non-limiting 
example of a mathematical algorithm utilized for the comparison of two 

1 5 sequences is the algorithm of Karlin and Altschul (1990) Proc. Nat'l Acad. Sci. 
USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Nat'l Acad. 
Sci. USA 90:5873-5877. Such an algorithm is incorporated into the NBLAST 
and XBLAST programs of Altschul, et al. (1990) J. Mol. Biol. 215:403-410. 
BLAST nucleotide searches can be performed with the NBLAST program, score 

2 0 = 100. wordlength = 1 2 to obtain nucleotide sequences similar or homologous to 

CARD-3, CARD-4, CARD-5, or CARD-6 nucleic acid molecules of the 
invention. For example, Example 5 describes the use of the TBLASTN program 
to query a database of sequences of full length and partial cDNA sequences with 
the human CARD-4 polypeptide sequence leading to the discovery of murine 
25 CARD-4 and Example 4 describes the use of BLASTN to query a proprietary 
EST database with the 5' untranslated sequence of CARD-4 leading to the 
discovery of two human CARD-4 splice variants. BLAST protein searches can 
be performed with the XBLAST program, score = 50. wordlength = 3 to obtain 
amino acid sequences homologous to CARD-3. CARD-4. CARD-5. or CARD-6 

3 0 protein molecules of the invention. To obtain gapped alignments for comparison 

-55 - 

BNSDOCID: <WO 0100826A2_I_> 



WO 01/00826 



PCT/USOO/17691 



purposes. Gapped BLAST can be utilized as described in Altschul et al. (1 997) 
Nucleic Acids Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST 
programs, the default parameters of the respective programs (e.g.. XBLAST and 
NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. Another preferred. 
5 non-limiting example of a mathematical algorithm utilized for the comparison of 
sequences is the algorithm of Myers and Miller. CABIOS (1989). Such an 
algorithm is incorporated into the ALIGN program (version 2.0) which is part of 
the GCG sequence alignment software package. When utilizing the ALIGN 
program for comparing amino acid sequences, a PAM120 weight residue table, a 
1 0 gap length penalty of 1 2. and a gap penalty of 4 can be used. 

The percent identity between two sequences can be determined using 
techniques similar to those described above, with or without allowing gaps. In 
calculating percent identity, typically exact matches are counted. 

The invention also provides CARD-3. CARD-4. CARD-5, or CARD- 
15 6 chimeric or fusion proteins. As used herein, a CARD-3. CARD-4. CARD-5. or 
CARD-6 "chimeric protein" or "fusion protein" comprises a CARD-3. CARD-4. 
CARD-5. or CARD-6 polypeptide operatively linked to a non-CARD-3. CARD- 
4. CARD-5. or CARD-6 polypeptide. A "CARD-3. CARD-4. CARD-5, or 
CARD-6 polypeptide" refers to a polypeptide having an amino acid sequence 
2 0 corresponding to all or a portion (preferably a biologically active portion) of a 
CARD-3, CARD-4. CARD-5. or CARD-6, whereas a "non-CARD-3. CARD-4. 
CARD-5, or CARD-6 polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not substantially identical to the 
CARD-3. CARD-4. CARD-5. or CARD-6 protein, e.g., a protein which is 

2 5 different from the CARD-3, CARD-4. CARD-5, or CARD-6 proteins and which 

is derived from the same or a different organism. Within the fusion protein, the 
term "operatively linked" is intended to indicate that the CARD-3. CARD-4, 
CARD-5, or CARD-6 polypeptide and the non-CARD-3. CARD-4. CARD-5, or 
CARD-6 polypeptide are fused in-frame to each other. The heterologous 

3 0 polypeptide can be fused to the N-terminus or C-terminus of the CARD-3. 
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CARD-4, CARD-5, or CARD-6 polypeptide. 

One useful fusion protein is a GST fusion protein in which the 
CARD-3, CARD-4, CARD-5, or CARD-6 sequences are fused to the C-terminus 
of the GST sequences. Such fusion proteins can facilitate the purification of 
5 recombinant CARD-3, CARD-4, CARD-5. or CARD-6. In another embodiment, 
the fusion protein contains a signal sequence from another protein. In certain 
host cells (e.g., mammalian host cells), expression and/or secretion of CARD-3, 
CARD-4, CARD-5. or CARD-6 can be increased through use of a heterologous 
signal sequence. For example, the gp67 secretory sequence of the baculovirus 

1 0 envelope protein can be used as a heterologous signal sequence (Current 

Protocols in Molecular Biology, Ausubel et al., eds.. John Wiley & Sons, 1992). 
Other examples of eukaryotic heterologous signal sequences include the secretory 
sequences of melittin and human placental alkaline phosphatase (Stratagene; La 
Jolla, California). In yet another example, useful prokaryotic heterologous signal 

1 5 sequences include the phoA secretory signal (Molecular cloning, Sambrook et al, 
second edition, Cold spring harbor laboratory press, 1989) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway. New Jersey). 

In yet another embodiment, the fusion protein is a CARD-3, CARD-4. 
CARD-5, or CARD-6-immunoglobulin fusion protein in which all or part of 

2 0 CARD-3. CARD-4, CARD-5, or CARD-6 is fused to sequences derived from a 
member of the immunoglobulin protein family. The CARD-3, CARD-4, CARD- 
5. or CARD-6-immunoglobulin fusion proteins of the invention can be 
incorporated into pharmaceutical compositions and administered to a subject to 
inhibit an interaction between a CARD-3, CARD-4, CARD-5, or CARD-6 ligand 

2 5 and a CARD-3, CARD-4, CARD-5, or CARD-6 protein on the surface of a cell, 

to thereby suppress CARD-3, CARD-4. CARD-5. or CARD-6-mediated signal 
transduction in vivo. The CARD-3. CARD-4, CARD-5, or CARD- 
6-immunoglobulin fusion proteins can be used to affect the bioavailability of a 
CARD-3. CARD-4. CARD-5, or CARD-6 cognate ligand. Inhibition of the 

3 0 CARD-3 ligand/CARD-3, CARD-4 ligand/CARD-4. CARD-5 ligand/CARD-5. 
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or CARD-6 ligand/CARD-6 interaction may be useful therapeutically for both 
the treatment of proliferative and differentiative disorders, as well as modulating 
(e.g., promoting or inhibiting) cell survival. Moreover, the CARD-3, CARD-4, 
CARD-5, or CARD-6-immunoglobulin fusion proteins of the invention can be 
5 used as immunogens to produce anti-CARD-3, CARD-4, CARD-5. or CARD-6 
antibodies in a subject, to purify CARD-3, CARD-4. CARD-5. or CARD-6 
ligands and in screening assays to identify' molecules which inhibit the interaction 
of CARD-3. CARD-4. CARD-5. or CARD-6 with a CARD-3. CARD-4. CARD- 
5. or CARD-6 ligand. 

10 Preferably, a CARD-3. CARD-4, CARD-5, or CARD-6 chimeric or 

fusion protein of the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional 
techniques, for example by employing blunt-ended or stagger-ended termini for 

15 ligation, restriction enzyme digestion to provide for appropriate termini, filling-in 
of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion 
gene can be synthesized by conventional techniques including automated DNA 
synthesizers. Alternatively. PCR amplification of gene fragments can be carried 

2 0 out using anchor primers which give rise to complementary overhangs between 

two consecutive gene fragments which can subsequently be annealed and 
reamplified to generate a chimeric gene sequence (see, e.g.. Current Protocols in 
Molecular Biology, Ausubel et al. eds.. John Wiley & Sons: 1992). Moreover, 
many expression vectors are commercially available that already encode a fusion 
25 moiety (e.g., a GST polypeptide). A CARD-3, CARD-4, CARD-5, or CARD-6- 
encoding nucleic acid can be cloned into such an expression vector such that the 
fusion moiety is linked in-frame to the CARD-3, CARD-4, CARD-5, or CARD-6 
protein. 

The present invention also pertains to variants of the CARD-3. 

3 0 CARD-4. CARD-5, or CARD-6 proteins which function as either CARD-3. 
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CARD-4, CARD-5, or CARD-6 agonists (mimetics) or as CARD-3. CARD-4, 
CARD-5. or CARD-6 antagonists. Variants of the CARD-3, CARD-4, CARD-5. 
or CARD-6 protein can be generated by mutagenesis, e.g., discrete point 
mutation or truncation of the CARD-3, CARD-4, CARD-5, or CARD-6 protein. 
5 An agonist of the CARD-3, CARD-4, CARD-5, or CARD-6 protein can retain 
substantially the same, or a subset, of the biological activities of the naturally 
occurring form of the CARD-3, CARD-4, CARD-5, or CARD-6 protein. An 
antagonist of the CARD-3, CARD-4, CARD-5, or CARD-6 protein can inhibit 
one or more of the activities of the naturally occurring form of the CARD-3, 

1 0 CARD-4. CARD-5. or CARD-6 protein by, for example, competitively binding 
to a downstream or upstream member of a cellular signaling cascade which 
includes the CARD-3, CARD-4. CARD-5, or CARD-6 protein. Thus, specific 
biological effects can be elicited by treatment with a variant of limited function. 
Treatment of a subject with a variant having a subset of the biological activities 

1 5 of the naturally occurring form of the protein can have fewer side effects in a 
subject relative to treatment with the naturally occurring form of the CARD-3, 
CARD-4. CARD-5, or CARD-6 proteins. 

Variants of the CARD-3, CARD-4, CARD-5, or CARD-6 protein 
which function as either CARD-3, CARD-4, CARD-5, or CARD-6 agonists 

2 0 (mimetics) or as CARD-3, CARD-4. CARD-5. or CARD-6 antagonists can be 
identified by screening combinatorial libraries of mutants, e.g.. truncation 
mutants of the CARD-3, CARD-4, CARD-5. or CARD-6 protein for CARD-3, 
CARD-4, CARD-5. or CARD-6 protein agonist or antagonist activity. In one 
embodiment, a variegated library of CARD-3, CARD-4, CARD-5, or CARD-6 

2 5 variants is generated by combinatorial mutagenesis at the nucleic acid level and is 

encoded by a variegated gene library. A variegated library of CARD-3. CARD- 
4, CARD-5, or CARD-6 variants can be produced by, for example, enzymatically 
ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential CARD-3, CARD-4. CARD-5. or CARD-6 sequences 

3 0 is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
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proteins (e.g., for phage display) containing the set of CARD-3, CARD-4, 
CARD-5. or CARD-6 sequences therein. There are a variety of methods which 
can be used to produce libraries of potential CARD-3, CARD-4, CARD-5, or 
CARD-6 variants from a degenerate oligonucleotide sequence. Chemical 
5 synthesis of a degenerate gene sequence can be performed in an automatic DNA 
synthesizer, and the synthetic gene then ligated into an appropriate expression 
vector. Use of a degenerate set of genes allows for the provision, in one mixture, 
of all of the sequences encoding the desired set of potential CARD-3, CARD-4, 
CARD-5. or CARD-6 sequences. Methods for synthesizing degenerate 

10 oligonucleotides are known in the art (see, e.g.. Narang (1983) Tetrahedron 39:3; 
Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ikeetal. (1983) Nucleic Acid Res. 11:477). 

Useful fragments of CARD-3. CARD-4. CARD-5. and CARD-6. 
include fragments comprising or consisting of a domain or subdomain described 

15 herein, e.g.. a kinase domain or a CARD domain. 

In addition, libraries of fragments of the CARD-3, CARD-4, CARD- 
5. or CARD-6 protein coding sequence can be used to generate a variegated 
population of CARD-3. CARD-4, CARD-5. or CARD-6 fragments for screening 
and subsequent selection of variants of a CARD-3. CARD-4, CARD-5. or 

20 CARD-6 protein. In one embodiment, a library of coding sequence fragments 
can be generated by treating a double stranded PCR fragment of a CARD-3. 
CARD-4, CARD-5, or CARD-6 coding sequence with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA, renaturing the DNA to form double stranded DNA which 

25 can include sense/antisense pairs from different nicked products, removing single 
stranded portions from reformed duplexes by treatment with S 1 nuclease, and 
ligating the resulting fragment library into an expression vector. By this method, 
an expression library can be derived which encodes N-terminal and internal 
fragments of various sizes of the CARD-3. CARD-4. CARD-5, or CARD-6 

3 0 protein. 
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Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening 
cDNA libraries for gene products having a selected property. Such techniques 
are adaptable for rapid screening of the gene libraries generated by the 
5 combinatorial mutagenesis of CARD-3, CARD-4, CARD-5. or CARD-6 
proteins. The most widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries typically include cloning 
the gene library into replicable expression vectors, transforming appropriate cells 
with the resulting library of vectors, and expressing the combinatorial genes 

1 0 under conditions in which detection of a desired activity facilitates isolation of 
the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to 
identify CARD-3. CARD-4, CARD-5, or CARD-6 variants (Arkin and Yourvan 

15 (1992) Proc. Natl. Acad. Sci. USA 89:781 1-7815; Delgrave et al. (1993) Protein 
Engineering 6(3):327-331). 

An isolated CARD-3, CARD-4, CARD-5, or CARD-6 protein, or a 
portion or fragment thereof, can be used as an immunogen to generate antibodies 
that bind CARD-3, CARD-4, CARD-5. or CARD-6 using standard techniques for 

2 0 polyclonal and monoclonal antibody preparation. The full-length CARD-3. 
CARD-4, CARD-5. or CARD-6 protein can be used or, alternatively, the 
invention provides antigenic peptide fragments of CARD-3, CARD-4, CARD-5, 
or CARD-6 for use as immunogens. The antigenic peptide of CARD-3, CARD-4. 
CARD-5, or CARD-6 comprises at least 8 (preferably 10, 15, 20, or 30) amino 

2 5 acid residues of the amino acid sequence shown in SEQ ID NO:2. SEQ ID NO:8. 

SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41 . SEQ ID NO:43, SEQ ID 
NO:49. SEQ ID NO:52. SEQ ID NO:55, or SEQ ID NO:61 or polypeptides 
including amino acids 128-139 or 287-298 of human CARD-4L and encompasses 
an epitope of CARD-3. CARD-4. CARD-5. or CARD-6 such that an antibody 

3 0 raised against the peptide forms a specific immune complex with CARD-3. 
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CARD-4, CARD-5, or CARD-6. 

Useful antibodies include antibodies which bind to a domain or 
subdomain of CARD-3, CARD-4, CARD-5, or CARD-6 described herein (e.g., a 
kinase domain, a CARD domain, or a leucine-rich domain). 
5 Preferred epitopes encompassed by the antigenic peptide are regions 

of CARD-3, CARD-4, CARD-5, or CARD-6 that are located on the surface of 
the protein, e.g., hydrophilic regions. Other important criteria include a 
preference for a terminal sequence, high antigenic index (e.g., as predicted by 
Jameson- Wolf algorithm), ease of peptide synthesis (e.g., avoidance of prolines); 
10 and high surface probability (e.g.. as predicted by the Emini algorithm; Figure 8 
and Figure 9). 

A CARD-3, CARD-4, CARD-5. or CARD-6 immunogen typically is 
used to prepare antibodies by immunizing a suitable subject, (e.g., rabbit, goat, 
mouse or other mammal) with the immunogen. An appropriate immunogenic 

15 preparation can contain, for example, recombinantly expressed CARD-3, CARD- 
4. CARD-5, or CARD-6 protein or a chemically synthesized CARD-3, CARD-4. 
CARD-5, or CARD-6 polypeptide. The preparation can further include an 
adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable subject with an 

20 immunogenic CARD-3, CARD-4, CARD-5. or CARD-6 preparation induces a 
polyclonal anti-CARD-3, CARD-4, CARD-5. or CARD-6 antibody response. 
For example, polypeptides including amino acids 128-139 or 287-298 of human 
CARD-4L were conjugated to KLH and the resulting conjugates were used to 
immunize rabbits and polyclonal antibodies that specifically recognize the two 

2 5 immunogen peptides were generated. 

Accordingly, another aspect of the invention pertains to anti-CARD-3. 
CARD-4, CARD-5, or CARD-6 antibodies. The term "antibody" as used herein 
refers to immunoglobulin molecules and immunologically active portions of 
immunoglobulin molecules, i.e.. molecules that contain an antigen binding site 

3 0 which specifically binds an antigen, such as CARD-3, CARD-4. CARD-5, or 
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CARD-6. A molecule which specifically binds to CARD-3, CARD-4, CARD-5, 
or CARD-6 is a molecule which binds CARD-3, CARD-4. CARD-5. or CARD- 
6, but does not substantially bind other molecules in a sample, e.g., a biological 
sample, which naturally contains CARD-3, CARD-4. CARD-5, or CARD-6. 
5 Examples of immunologically active portions of immunoglobulin molecules 
include F(ab) and F(ab')2 fragments which can be generated by treating the 
antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind CARD-3, CARD-4, CARD-5, or CARD-6. The 
term "monoclonal antibody" or "monoclonal antibody composition", as used 

1 0 herein, refers to a population of antibody molecules that contain only one species 
of an antigen binding site capable of immunoreacting with a particular epitope of 
CARD-3, CARD-4, CARD-5, or CARD-6. A monoclonal antibody composition 
thus typically displays a single binding affinity for a particular CARD-3. CARD- 
4. CARD-5, or CARD-6 protein with which it immunoreacts. 

1 5 Polyclonal anti-CARD-3, CARD-4, CARD-5, or CARD-6 antibodies 

can be prepared as described above by immunizing a suitable subject with a 
CARD-3, CARD-4. CARD-5, or CARD-6 immunogen. The anti-CARD-3, 
CARD-4, CARD-5. or CARD-6 antibody titer in the immunized subject can be 
monitored over time by standard techniques, such as with an enzyme linked 

2 0 immunosorbent assay (ELISA) using immobilized CARD-3. CARD-4, CARD-5. 
or CARD-6. If desired, the antibody molecules directed against CARD-3, 
CARD-4, CARD-5, or CARD-6 can be isolated from the mammal (e.g.. from the 
blood) and further purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after 

2 5 immunization, e.g., when the anti-CARD-3, CARD-4, CARD-5, or CARD-6 

antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, such 
as the hybridoma technique originally described by Kohler and Milstein (1975) 
Nature 256:495-497, the human B cell hybridoma technique (Kozbor et al. (1983) 

3 0 Immunol Today 4:72), the EBV-hybridoma technique (Cole et al. (1985), 
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Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or 
trioma techniques. The technology for producing various antibodies monoclonal 
antibody hybridomas is well known (see generally Current Protocols in 
Immunology (1994) Coligan et al. (eds.) John Wiley & Sons, Inc.. New York. 
5 NY). Briefly, an immortal cell line (typically a myeloma) is fused to 

lymphocytes (typically splenocytes) from a mammal immunized with a CARD-3 . 
CARD-4, CARD-5. or CARD-6 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds CARD-3. CARD-4. 

1 0 CARD-5. or CARD-6. 

Any of the many well known protocols used for fusing lymphocytes 
and immortalized cell lines can be applied for the purpose of generating an 
anti-CARD-3, CARD-4, CARD-5, or CARD-6 monoclonal antibody (see. e.g., 
Current Protocols in Immunology, supra; Galfre et al. (1977) Nature 266:55052; 

15 R.H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological 
Analyses, Plenum Publishing Corp., New York, New York (1 980); and Lerner 
(1981) Yale J. Biol. Med.. 54:387-402). Moreover, the ordinarily skilled worker 
will appreciate that there are many variations of such methods which also would 
be useful. Typically, the immortal cell line (e.g.. a myeloma cell line) is derived 

2 0 from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with 
an immunogenic preparation of the present invention with an immortalized 
mouse cell line, e.g., a myeloma cell line that is sensitive to culture medium 
containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 

2 5 number of myeloma cell lines can be used as a fusion partner according to 

standard techniques, e.g.. the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 
myeloma lines. These myeloma lines are available from ATCC. Typically, 
HAT-sensitive mouse myeloma cells are fused to mouse splenocytes using 
polyethylene glycol ("PEG"). Hybridoma cells resulting from the fusion are then 

3 0 selected using HAT medium, which kills unfused and unproductively fused 
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myeloma cells (unfused splenocytes die after several days because they are not 
transformed). Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture supernatants for 
antibodies that bind CARD-3, CARD-4, CARD-5. or CARD-6, e.g.. using a 
5 standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-CARD-3, CARD-4, CARD-5, or CARD-6 antibody can be 
identified and isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g., an antibody phage display library) with CARD-3, 

10 CARD-4, CARD-5. or CARD-6 to thereby isolate immunoglobulin library 
members that bind CARD-3. CARD-4. CARD-5. or CARD-6. Kits for 
generating and screening phage display libraries are commercially available (e.g.. 
the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; 
and the Stratagene SurfZAP Phage Display Kit, Catalog No. 240612). 

15 Additionally, examples of methods and reagents particularly amenable for use in 
generating and screening antibody display library can be found in, for example, 
U.S. Patent No. 5,223.409; PCT Publication No. WO 92/18619; PCT Publication 
No. WO 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. 
WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 

2 0 92/0 1 047; PCT Publication No. WO 92/09690; PCT Publication No. WO 
90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) 
Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281: 
Griffiths et al. (1993) EMBO J. 12:725-734. 

Additionally, recombinant anti-CARD-3, CARD-4, CARD-5. or 

2 5 CARD-6 antibodies, such as chimeric and humanized monoclonal antibodies. 

comprising both human and non-human portions, which can be made using 
standard recombinant DNA techniques, are within the scope of the invention. 
Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. for example using methods 

3 0 described in PCT Publication No. WO 87/02671 ; European Patent Application 
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1 84,1 87; European Patent Application 171 ,496; European Patent Application 
173,494; PCT Publication No. WO 86/01533: U.S. Patent No. 4,816,567; 
European Patent Application 125,023; Better etal. (1988) Science 240:1041- 
1043; Liu et al. (1987) Proc. Natl. Acad. Sci. USA 84:3439-3443; Liu et al. 
5 (1987) J. Immunol. 139:3521-3526; Sun et al. (1987) Proc. Natl. Acad. Sci. USA 
84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et al. (1985) 
Nature 314:446-449; and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); 
Morrison, (1985) Science 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214; 
U.S. Patent 5.225.539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. 

10 ( 1988) Science 239: 1534; and Beidler etal. (1988) J. Immunol. 141:4053-4060. 

An anti-CARD-3, CARD-4, CARD-5. or CARD-6 antibody (e.g., 
monoclonal antibody) can be used to isolate CARD-3. CARD-4. CARD-5, or 
CARD-6 by standard techniques, such as affinity chromatography or 
immunoprecipitation. An anti-CARD-3, CARD-4, CARD-5. or CARD-6 

15 antibody can facilitate the purification of natural CARD-3, CARD-4, CARD-5, or 
CARD-6 from cells and of recombinantly produced CARD-3, CARD-4, CARD- 
5, or CARD-6 expressed in host cells. Moreover, an anti-CARD-3. CARD-4. 
CARD-5. or CARD-6 antibody can be used to detect CARD-3, CARD-4. CARD- 
5. or CARD-6 protein (e.g.. in a cellular lysate or cell supernatant) in order to 

2 0 evaluate the abundance and pattern of expression of the CARD-3, CARD-4, 
CARD-5, or CARD-6 protein. Anti-CARD-3, CARD-4, CARD-5, or CARD-6 
antibodies can be used diagnostically to monitor protein levels in tissue as part of 
a clinical testing procedure, e.g., to. for example, determine the efficacy of a 
given treatment regimen. Detection can be facilitated by coupling the antibody to 

2 5 a detectable substance. Examples of detectable substances include various 

enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase. B-galactosidase. 
or acetylcholinesterase; examples of suitable prosthetic group complexes include 

3 0 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
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include umbelliferone, fluorescein, fluorescein isothiocyanate. rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol; examples of bioluminescent materials 
include luciferase, luciferin, and aequorin, and examples of suitable radioactive 
5 material include 125 I, 13, 1, 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably 
expression vectors, containing a nucleic acid encoding CARD-3, CARD-4, 

1 0 CARD-5, or CARD-6 (or a portion thereof). As used herein, the term "vector" 
refers to a nucleic acid molecule capable of transporting another nucleic acid to 
which it has been linked. One type of vector is a "plasmid", which refers to a 
circular double stranded DNA loop into which additional DNA segments can be 
ligated. Another type of vector is a viral vector, wherein additional DNA 

15 segments can be ligated into the viral genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated 
into the genome of a host cell upon introduction into the host cell, and thereby are 

2 0 replicated along with the host genome. Moreover, certain vectors, expression 
vectors, are capable of directing the expression of genes to which they are 
operatively linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors 

2 5 (e.g.. replication defective retroviruses, adenoviruses and adeno-associated 

viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a 
nucleic acid of the invention in a form suitable for expression of the nucleic acid 
in a host cell, which means that the recombinant expression vectors include one 

3 0 or more regulatory sequences, selected on the basis of the host cells to be used for 
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expression, which is operatively linked to the nucleic acid sequence to be 
expressed. Within a recombinant expression vector, "operably linked" is 
intended to mean that the nucleotide sequence of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression of the nucleotide 
5 sequence (e.g., in an in vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell). The term "regulatory sequence" 
is intended to include promoters, enhancers and other expression control elements 
(e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 

10 185, Academic Press, San Diego, CA (1990). Regulatory sequences include 

those which direct constitutive expression of a nucleotide sequence in many types 
of host cell and those which direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory sequences). It will be 
appreciated by those skilled in the art that the design of the expression vector can 

1 5 depend on such factors as the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression vectors of the invention can 
be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g.. 
CARD-3. CARD-4. CARD-5, or CARD-6 proteins, mutant forms of CARD-3. 

2 0 CARD-4. CARD-5, or CARD-6. fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed 
for expression of CARD-3, CARD-4, CARD-5, or CARD-6 in prokaryotic or 
eukaryotic cells, e.g., bacterial cells such as E. coli, insect cells (using 
baculovirus expression vectors) yeast cells or mammalian cells. Suitable host 

2 5 cells are discussed further in Goeddel, Gene Expression Technology: Methods in 

Enzymology 185, Academic Press. San Diego. CA (1990). Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for 
example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. 

3 0 coli with vectors containing constitutive or inducible promoters directing the 
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expression of either fusion or non-fusion proteins. Fusion vectors add a number 
of amino acids to a protein encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve three purposes: 1 ) to 
increase expression of recombinant protein; 2) to increase the solubility of the 
5 recombinant protein; and 3) to aid in the purification of the recombinant protein 
by acting as a ligand in affinity purification. Often, in fusion expression vectors, 
a proteolytic cleavage site is introduced at the junction of the fusion moiety and 
the recombinant protein to enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion protein. Such enzymes, 

1 0 and their cognate recognition sequences, include Factor Xa, thrombin and 
enterokinase. Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), pMAL (New England 
Biolabs, Beverly, MA) and pRIT5 (Pharmacia. Piscataway, NJ) which fuse 
glutathione S-transferase (GST), maltose E binding protein, or protein A, 

1 5 respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors 
include pTrc (Amann et al., (1988) Gene 69:301-315) andpET 1 1 d (Studier et 
al.. Gene Expression Technology: Methods in Enzymology ] 85, Academic Press. 
San Diego, California (1990) 60-89). Target gene expression from the pTrc 

2 0 vector relies on host RNA polymerase transcription from a hybrid trp-lac fusion 
promoter. Target gene expression from the pET 1 Id vector relies on transcription 
from a T7 gnlO-lac fusion promoter mediated by a coexpressed viral RNA 
polymerase (T7 gnl). This viral polymerase is supplied by host strains 
BL21(DE3) or HMS174(DE3) from a resident X prophage harboring a T7 gnl 

2 5 gene under the transcriptional control of the lacUV5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is 
to express the protein in a host bacteria with an impaired capacity to 
proteolytically cleave the recombinant protein (Gottesman. Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, 

3 0 California (1990) 1 19-128). Another strategy is to alter the nucleic acid sequence 
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of the nucleic acid to be inserted into an expression vector so that the individual 
codons for each amino acid are those preferentially utilized in E. coli (Wada et al. 
(1992) Nucleic Acids Res. 20:21 1 1-2118). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis 
5 techniques. 

In another embodiment, the CARD-3, CARD-4, CARD-5. or CARD- 
6 expression vector is a yeast expression vector. Examples of vectors for 
expression in yeast S. cerivisae include pYepSecl (Baldari et al. (1987) EMBO J. 
6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 

10 (Schultz et al. (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation. San 
Diego, CA), pGBT9 (Clontech, Palo Alto, CA), pGADIO (Clontech. Palo Alto. 
CA), pYADE4 and pYGAE2 and pYPGE2 (Brunelli and Pall, (1993) Yeast 
9:1299-1308), pYPGEl 5 (Brunelli and Pall, (1993) Yeast 9:1309-1318), pACTIl 
(Dr. S.E. Elledge, Baylor College of Medicine), and picZ (InVitrogen Corp, San 

1 5 Diego, CA). For example, in Example 7 the expression of a fusion protein 
comprising amino acids 1-145 of human CARD-4L fused to the DNA-binding 
domain of S. cerevisiae transcription factor GAL4 from the yeast expression 
vector pGBT9 is described. In another example, Example 8 describes the 
expression of a fusion protein comprising amino acids 406-953 of human C ARD- 

2 0 4L fused to the DNA-binding domain of S. cerevisiae transcription factor GAL4 
from the yeast expression vector pGBT9. In yet another example. Example 7 
describes the expression of a fusion protein comprising CARD-3 fused to the 
transcriptional activation domain of S. cerevisiae transcription factor GAL4 from 
the yeast expression vector pACTII. 

2 5 Alternatively, CARD-3, CARD-4, CARD-5, or CARD-6 can be 

expressed in insect cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured insect cells (e.g.. Sf 9 
cells) include the pAc series (Smith et al. (1983) Mol. Cell Biol. 3:2156-2165) 
and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

3 0 In yet another embodiment, a nucleic acid of the invention is 
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expressed in mammalian cells using a mammalian expression vector. Examples 
of mammalian expression vectors include pCDM8 (Seed (1987) Nature 329:840). 
pCI (Promega), and pMT2PC (Kaufman et al. (1987) EMBO J. 6:187-195). 
When used in mammalian cells, the expression vector's control functions are 
5 often provided by viral regulatory elements. For example, commonly used 

promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian 
Virus 40. For other suitable expression systems for both prokaryotic and 
eukaryotic cells see chapters 16 and 17 of Sambrook et al. (supra). For example. 
Example 9, Example 10, and Example 12 describe the expression of human 
1 0 CARD-4 or fragments thereof, CARD-3, or both from the mammalian expression 
vector pCI. 

In another embodiment, the recombinant mammalian expression 
vector is capable of directing expression of the nucleic acid preferentially in a 
particular cell type (e.g.. tissue-specific regulatory elements are used to express 

15 the nucleic acid). Tissue-specific regulatory elements are known in the art. Non- 
limiting examples of suitable tissue-specific promoters include the albumin 
promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1:268-277), lymphoid- 
specific promoters (Calame and Eaton (1988) Adv. Immunol. 43:235-275), in 
particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J. 

2 0 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 33:729-740; Queen 
and. Baltimore (1983) Cell 33:741-748), neuron-specific promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 
230:912-936), and mammary gland-specific promoters (e.g., milk whey 

2 5 promoter; U.S. Patent No. 4,873,3 1 6 and European Application Publication No. 
264.166). Developmentally-regulated promoters are also encompassed, for 
example the murine hox promoters (Kessel and Gruss (1990) Science 249:374- 
379) and the cc-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 
3:537-546). 
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The invention further provides a recombinant expression vector 
comprising a DNA molecule of the invention cloned into the expression vector in 
an antisense orientation. That is, the DNA molecule is operatively linked to a 
regulatory sequence in a manner which allows for expression (by transcription of 
5 the DNA molecule) of an RNA molecule which is antisense to CARD-3, CARD- 
4, CARD-5, or CARD-6 mRNA. Regulatory sequences operatively linked to a 
nucleic acid cloned in the antisense orientation can be chosen which direct the 
continuous expression of the antisense RNA molecule in a variety of cell types, 
for instance viral promoters and/or enhancers, or regulatory sequences can be 

1 0 chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which antisense nucleic 
acids are produced under the control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into which the vector is 

1 5 introduced. For a discussion of the regulation of gene expression using antisense 
genes see Weintraub et al. (Reviews - Trends in Genetics, Vol. 1(1) 1986). 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention or isolated nucleic acid molecule 
of the invention has been introduced. The terms "host cell" and "recombinant 

2 0 host cell" are used interchangeably herein. It is understood that such terms refer 
not only to the particular subject cell but to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations 
due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the 

2 5 term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, 
CARD-3, CARD-4, CARD-5, or CARD-6 protein can be expressed in bacterial 
cells such as E. coli. insect cells, yeast or mammalian cells (such as Chinese 
hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to 

3 0 those skilled in the art. For example, in Example 7 a Saccharomyces cerevisiae 
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host cell for recombinant CARD-4 and CARD-3 expression is described, and in 
Examples 9, 10, and 12, a 293T host cells for expression of CARD-4 or 
fragments thereof or CARD-3 are described. 

Vector DNA or an isolated nucleic acid molecule of the invention can 
5 be introduced into prokaryotic or eukaryotic cells via conventional transformation 
or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for 
introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 

1 0 transfection. lipofection, or electroporation. Suitable methods for transforming or 
iransfecting host cells can be found in Sambrook, et al. (supra), and other 
laboratory manuals. 

For stable transfection of mammalian cells, it is known that, 
depending upon the expression vector and transfection technique used, only a 

1 5 small fraction of cells may integrate the foreign DNA into their genome. In some 
cases vector DNA is retained by the host cell. In other cases the host cell does 
not retain vector DNA and retains only an isolated nucleic acid molecule of the 
invention carried by the vector. In some cases, and isolated nucleic acid 
molecule of the invention is used to transform a cell without the use of a vector. 

2 0 In order to identify and select these integrants, a gene that encodes a 

selectable marker (e.g., resistance to antibiotics) is generally introduced into the 
host cells along with the gene of interest. Preferred selectable markers include 
those which confer resistance to drugs, such as G418, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a 

25 host cell on the same vector as that encoding CARD-3. CARD-4, CARD-5, or 
CARD-6 or can be introduced on a separate vector. Cells stably transfected with 
the introduced nucleic acid can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells 
die). 

30 
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A host cell of the invention, such as a prokaryotic or eukaryotic host 
cell in culture, can be used to produce (i.e., express) a CARD-3. CARD-4, 
CARD-5, or CARD-6 protein. Accordingly, the invention further provides 
methods for producing CARD-3. CARD-4, CARD-5, or CARD-6 protein using 
5 the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of the invention (into which a recombinant expression 
vector or isolated nucleic acid molecule encoding CARD-3, CARD-4, CARD-5. 
or CARD-6 has been introduced) in a suitable medium such that CARD-3, 
CARD-4, CARD-5, or CARD-6 protein is produced. In another embodiment, the 

1 0 method further comprises isolating CARD-3, CARD-4, CARD-5. or CARD-6 
from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention 
is a fertilized oocyte or an embryonic stem cell into which CARD-3, CARD-4, 

15 CARD-5, or CARD-6-coding sequences have been introduced. Such host cells 
can then be used to create non-human transgenic animals in which exogenous 
CARD-3, CARD-4. CARD-5. or CARD-6 sequences have been introduced into 
their genome or homologous recombinant animals in which endogenous CARD- 
3. CARD-4. CARD-5. or CARD-6 sequences have been altered. Such animals 

2 0 are useful for studying the function and/or activity of CARD-3, CARD-4. CARD- 
5. or CARD-6 and for identifying and/or evaluating modulators of CARD-3. 
CARD-4, CARD-5, or CARD-6 activity. As used herein, a "transgenic animal" 
is a non-human animal, preferably a mammal, more preferably a rodent such as a 
rat or mouse, in which one or more of the cells of the animal includes a 

2 5 transgene. Other examples of transgenic animals include non-human primates, 

sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic 
animal develops and which remains in the genome of the mature animal, thereby 
directing the expression of an encoded gene product in one or more cell types or 

3 0 tissues of the transgenic animal. As used herein, an "homologous recombinant 
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animal" is a non-human animal, preferably a mammal, more preferably a mouse, 
in which an endogenous CARD-3, CARD-4, CARD-5, or CARD-6 gene has 
been altered by homologous recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
5 cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing 
CARD-3, CARD-4, CARD-5, or CARD-6-encoding nucleic acid into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female foster animal. The 

10 CARD-3, CARD-4. CARD-5, or CARD-6 cDNA sequence, e.g.. that of SEQ ID 
NO:l, SEQ ID NO:3. SEQ ID NO:7, SEQ ID NO:9, SEQ ID:25, SEQ ID NO:27. 
SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:48, SEQ ID 
NO:50, SEQ ID NO:51, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:56. SEQ 
ID NO:60. SEQ ID NO:62, or the cDNA of ATCC 203037. or the cDNA of 

15 ATCC 203035, or the cDNA of ATCC 203036, or the cDNA of ATCC PTA-21 1, 
the cDNA of ATCC PTA-212, or the cDNA of ATCC PTA-21 3) can be 
introduced as a transgene into the genome of a non-human animal. Alternatively, 
a nonhuman homolog or ortholog of the human CARD-3. CARD-4, CARD-5, or 
CARD-6 gene, such as a mouse CARD-3, CARD-4, CARD-5, or CARD-6 gene. 

2 0 can be isolated based on hybridization to the human CARD-3, CARD-4, CARD- 
5. or CARD-6 cDNA and used as a transgene. For example, the mouse ortholog 
of CARD-4, Figure 15 and SEQ ID NO:42 can be used to make a transgenic 
animal using standard methods. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of 

2 5 the transgene. A tissue-specific regulatory sequence(s) can be operably linked to 
the CARD-3, CARD-4, CARD-5, or CARD-6 transgene to direct expression of 
CARD-3. CARD-4, CARD-5. or CARD-6 protein to particular cells. Methods 
for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are 

30 described, for example, in U.S. Patent Nos. 4,736,866 and 4,870.009, U.S. Patent 
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No. 4.873,191 and in Hogan, Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar methods are 
used for production of other transgenic animals. A transgenic founder animal can 
be identified based upon the presence of the CARD-3, CARD-4, CARD-5, or 
5 CARD-6 transgene in its genome and/or expression of CARD-3. CARD-4. 
CARD-5, or CARD-6 mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the 
transgene. Moreover, transgenic animals carrying a transgene encoding CARD- 
3. CARD-4, CARD-5, or CARD-6 can further be bred to other transgenic animals 

1 0 carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared 
which contains at least a portion of a CARD-3, CARD-4, CARD-5, or CARD-6 
gene (e.g., a human or a non-human homolog of the CARD-3, CARD-4, CARD- 
5. or CARD-6 gene, e.g., a murine CARD-3, CARD-4, CARD-5, or CARD-6 

1 5 gene) into which a deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the CARD-3, CARD-4. CARD-5, or 
CARD-6 gene. In an embodiment, the vector is designed such that, upon 
homologous recombination, the endogenous CARD-3, CARD-4, CARD-5. or 
CARD-6 gene is functionally disrupted (i.e.. no longer encodes a functional 

2 0 protein; also referred to as a "knock out" vector). Alternatively, the vector can be 

designed such that, upon homologous recombination, the endogenous CARD-3, 
CARD-4, CARD-5, or CARD-6 gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous CARD-3, CARD-4, CARD-5, or 
25 CARD-6 protein). In the homologous recombination vector, the altered portion of 
the CARD-3, CARD-4, CARD-5, or CARD-6 gene is flanked at its 5' and 3' ends 
by additional nucleic acid of the CARD-3, CARD-4, CARD-5. or CARD-6 gene 
to allow for homologous recombination to occur between the exogenous CARD- 
3, CARD-4. CARD-5, or CARD-6 gene carried by the vector and an endogenous 

3 0 CARD-3. CARD-4, CARD-5, or CARD-6 gene in an embryonic stem cell. The 
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additional flanking CARD-3, CARD-4, CARD-5, or CARD-6 nucleic acid is of 
sufficient length for successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' ends) 
are included in the vector (see, e.g., Thomas and Capecchi (1987) Cell 51 :503 for 
5 a description of homologous recombination vectors). The vector is introduced 
into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced CARD-3, CARD-4, CARD-5, or CARD-6 gene has homologously 
recombined with the endogenous CARD-3, CARD-4, CARD-5, or CARD-6 gene 
are selected (see, e.g., Li et al. (1992) Cell 69:915). The selected cells are then 

10 injected into a blastocyst of an animal (e.g., a mouse) to form aggregation 

chimeras (see, e.g., Bradley in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed. (IRL. Oxford, 1987) pp. 1 13-152). A 
chimeric embryo can then be implanted into a suitable pseudopregnant female 
foster animal and the embryo brought to term. Progeny harboring the 

15 homologously recombined DNA in their germ cells can be used to breed animals 
in which all cells of the animal contain the homologously recombined DNA by 
germline transmission of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described 
further in Bradley (1991) Current Opinion in Bio/Technology 2:823-829 and in 

2 0 PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 92/0968. and WO 
93/04169. 

In another embodiment, transgenic non-humans animals can be 
produced which contain selected systems which allow for regulated expression of 
the transgene. One example of such a system is the cre/loxP recombinase system 

2 5 of bacteriophage PI . For a description of the cre/loxP recombinase system, see. 

e.g.. Lakso et al. (1992) Proc. Natl. Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251 : 135 1-1 355. If a 
cre/loxP recombinase system is used to regulate expression of the transgene. 

3 0 animals containing transgenes encoding both the Cre recombinase and a selected 
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protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and the other containing a 
transgene encoding a recombinase. 
5 Clones of the non-human transgenic animals described herein can also 

be produced according to the methods described in Wilmut et al. (1997) Nature 
385:810-813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. In 
brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter Go phase. The quiescent cell can then 

10 be fused, e.g., through the use of electrical pulses, to an enucleated oocyte from 
an animal of the same species from which the quiescent cell is isolated. The 
reconstructed oocyte is then cultured such that it develops to morula or blastocyte 
and then transferred to pseudopregnant female foster animal. The offspring 
borne of this female foster animal will be a clone of the animal from which the 

15 cell, e.g., the somatic cell, is isolated. 

IV. Pharmaceutical Compositions 

The CARD-3. CARD-4, CARD-5, or CARD-6 nucleic acid 
molecules, CARD-3, CARD-4. CARD-5, or CARD-6 proteins, and anti-CARD- 
2 0 3, CARD-4, CARD-5, or CARD-6 antibodies (also referred to herein as "active 
compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein the language "pharmaceutically acceptable carrier" is 

2 5 intended to include any and all solvents, dispersion media, coatings, antibacterial 

and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except 
insofar as any conventional media or agent is incompatible with the active 

3 0 compound, use thereof in the compositions is contemplated. Supplementary 
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active compounds can also be incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical 
compositions for modulating the expression or activity of a polypeptide or 
nucleic acid of the invention. Such methods comprise formulating a 
5 pharmaceutically acceptable carrier with an agent which modulates expression or 
activity of a polypeptide or nucleic acid of the invention. Such compositions can 
further include additional active agents. Thus, the invention further includes 
methods for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which modulates expression or 

1 0 activity of a polypeptide or nucleic acid of the invention and one or more 
addtional active compounds. 

The agent which modulates expression or activity may, for example, 
be a small molecule. For example, such small molecules include peptides, 
peptidomimetics, amino acids, amino acid analogs, polynucleotides, 

1 5 polynucleotide analogs, nucleotides, nucleotide analogs, organic or inorganic 

compounds (i.e., including heteroorganic and organometallic compounds) having 
a molecular weight less than about 10,000 grams per mole, organic or inorganic 
compounds having a molecular weight les than about 5,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 1 .000 

2 0 grams per mole, organic or inorganic compounds having a molecular weight less 
than about 500 grams per mole, and salts, esters, and other pharmaceutically 
acceptable forms of such compounds. It is understood that appropriate doses of 
small molecule agents depends upon a number of factors within the ken of the 
ordinarily skilled physician, veterinarian, or researcher. The dose(s) of the small 

25 molecule will vary, for example, depending upon the identity, size, and condition 
of the subject or sample being treated, further depending upon the route by which 
the composition is to be administered, if applicable, and the effect which the 
practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. Exemplary doses include milligram or microgram 

30 amounts of the small molecule per kilogram of subject or sample weight (e.g., 
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about 1 microgram per kilogram to about 500 milligrams per kilogram, about 100 
micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram. It is furthermore 
understood that appropriate doses of a small molecule depend upon the potency 
5 of the small molecule with respect to the expression or activity to modulated. 
Such appropriate doses may be determined using the assays described herein. 
When one or more of these small molecules is to be administered to an animal 
(e.g.. a human) in order to modulate expression or activity of a polypeptide or 
nucleic acid of the invention, a physician, veterinarian, or researcher may, for 

1 0 example, prescribe a relatively low dose at first, subsequently increasing the dose 
until an appropriate response is obtained. In addition, it is understood that the 
specific dose level for any particular subject will depend upon a variety of factors 
including the activity of the specific compound employed, the age, body weight, 
general health, gender, and diet of the subject, the time of administration, the 

15 route of administration, the rate of excretion, any drug combination, and the 
degree of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be 
compatible with its intended route of administration. Examples of routes of 
administration include parenteral, e.g.. intravenous, intradermal, subcutaneous. 

2 0 oral (e.g.. inhalation), transdermal (topical), transmucosal, and rectal 

administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent 
such as water for injection, saline solution, fixed oils, polyethylene glycols, 
glycerine, propylene glycol or other synthetic solvents; antibacterial agents such 

2 5 as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 

sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; 
buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation 

3 0 can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
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glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile 
aqueous solutions (where water soluble) or dispersions and sterile powders for 
the extemporaneous preparation of sterile injectable solutions or dispersion. For 
5 intravenous administration, suitable carriers include physiological saline, 
bacteriostatic water. Cremophor EL? (BASF; Parsippany, NJ) or phosphate 
buffered saline (PBS). In all cases, the composition must be sterile and should be 
fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the 

10 contaminating action of microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (for example, glycerol, propylene glycol, and liquid polyetheylene glycol, 
and the like), and suitable mixtures thereof. The proper fluidity can be 
maintained, for example, by the use of a coating such as lecithin, by the 

1 5 maintenance of the required particle size in the case of dispersion and by the use 
of surfactants. Prevention of the action of microorganisms can be achieved by 
various antibacterial and antifungal agents, for example, parabens. chlorobutanol. 
phenol, ascorbic acid, thimerosal. and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as 

2 0 mannitol. sorbitol, sodium chloride in the composition. Prolonged absorption of 

the injectable compositions can be brought about by including in the composition 
an agent which delays absorption, for example, aluminum monostearate and 
gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
2 5 compound (e.g., a CARD-3, CARD-4. CARD-5, or CARD-6 protein or 

anti-CARD-3, CARD-4. CARD-5, or CARD-6 antibody) in the required amount 
in an appropriate solvent with one or a combination of ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a sterile vehicle which 

3 0 contains a basic dispersion medium and the required other ingredients from those 
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enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and 
freeze-drying which yields a powder of the active ingredient plus any additional 
desired ingredient from a previously sterile-filtered solution thereof. 
5 Oral compositions generally include an inert diluent or an edible 

carrier. They can be enclosed in gelatin capsules or compressed into tablets. For 
the purpose of oral therapeutic administration, the active compound can be 
incorporated with excipients and used in the form of tablets, troches, or capsules. 
Oral compositions can also be prepared using a fluid carrier for use as a 

1 0 mouthwash, wherein the compound in the fluid carrier is applied orally and 
swished and expectorated or swallowed. Pharmaceutically compatible binding 
agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following 
ingredients, or compounds of a similar nature: a binder such as microcrystalline 

1 5 cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange flavoring. For administration by 

2 0 inhalation, the compounds are delivered in the form of an aerosol spray from 
pressured container or dispenser which contains a suitable propel lant. e.g., a gas 
such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal 
means. For transmucosal or transdermal administration, penetrants appropriate to 

2 5 the barrier to be permeated are used in the formulation. Such penetrants are 

generally known in the art, and include, for example, for transmucosal 
administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of nasal sprays or 
suppositories. For transdermal administration, the active compounds are 

3 0 formulated into ointments, salves, gels, or creams as generally known in the art. 
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The compounds can also be prepared in the form of suppositories 
(e.g., with conventional suppository bases such as cocoa butter and other 
glycerides) or retention enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers 
5 that will protect the compound against rapid elimination from the body, such as a 
controlled release formulation, including implants and microencapsulated 
delivery systems. Biodegradable, biocompatible polymers can be used, such as 
ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, 
polyorthoesters, and polylactic acid. Methods for preparation of such 

1 0 formulations will be apparent to those skilled in the art. The materials can also 
be obtained commercially from Alza Corporation and Nova Pharmaceuticals. Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with 
monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those 

1 5 skilled in the art, for example, as described in U.S. Patent No. 4,522,8 1 1 . 

It is especially advantageous to formulate oral or parenteral 
compositions in dosage unit form for ease of administration and uniformity of 
dosage. Dosage unit form as used herein refers to physically discrete units suited 
as unitary dosages for the subject to be treated; each unit containing a 

2 0 predetermined quantity of active compound calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. The 
specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the limitations inherent in the art 

2 5 of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of 
body weight (generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the 
brain, a dosage of 50 mg/kg to 100 mg/kg is usually appropriate. Generally, 
partially human antibodies and fully human antibodies have a longer half-life 

3 0 within the human body than other antibodies. Accordingly, lower dosages and 
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less frequent administration is often possible. Modifications such as lipidation 
can be used to stabilize antibodies and to enhance uptake and tissue penetration 
(e.g., into the brain). A method for lipidation of antibodies is described by 
Cruikshank et al. ((1997) J. Acquired Immune Deficiency Syndromes and Human 
5 Retrovirology 14:193). 

The nucleic acid molecules of the invention can be inserted into 
vectors and used as gene therapy vectors. Gene therapy vectors can be delivered 
to a subject by, for example, intravenous injection, local administration (U.S. 
Patent 5.328,470) or by stereotactic injection (see, e.g., Chen et al. (1994) Proc. 

10 Natl. Acad. Sci. USA 91 :3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector in an acceptable diluent, 
or can comprise a slow release matrix in which the gene delivery vehicle is 
imbedded. Alternatively, where the complete gene delivery vector can be 
produced intact from recombinant cells, e.g. retroviral vectors, the 

1 5 pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, 
pack, or dispenser together with instructions for administration. 

2 0 V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues. and 
antibodies described herein can be used in one or more of the following methods: 
a) screening assays; b) detection assays (e.g., chromosomal mapping, tissue 
typing, forensic biology), c) predictive medicine (e.g., diagnostic assays, 

2 5 prognostic assays, monitoring clinical trials, and pharmacogenomics); and d) 

methods of treatment (e.g., therapeutic and prophylactic). A CARD-3, CARD-4. 
CARD-5, or CARD-6 protein interacts with other cellular proteins and can thus 
be used for (i) regulation of cellular proliferation; (ii) regulation of cellular 
differentiation: and (in) regulation of cell survival. The isolated nucleic acid 

3 0 molecules of the invention can be used to express CARD-3, CARD-4, CARD-5, 
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or CARD-6 protein (e.g., via a recombinant expression vector in a host cell in 
gene therapy applications), to detect CARD-3, CARD-4, CARD-5, or CARD-6 
mRNA (e.g., in a biological sample) or a genetic lesion in a CARD-3. CARD-4, 
CARD-5, or CARD-6 gene, and to modulate CARD-3, CARD-4, CARD-5, or 
5 CARD-6 activity. In addition, the CARD-3. CARD-4, CARD-5, or CARD-6 
proteins can be used to screen drugs or compounds which modulate the CARD-3. 
CARD-4, CARD-5, or CARD-6 activity or expression as well as to treat 
disorders characterized by insufficient or excessive production of CARD-3, 
CARD-4, CARD-5. or CARD-6 protein or production of CARD-3, CARD-4. 

1 0 CARD-5, or CARD-6 protein forms which have decreased or aberrant activity 
compared to CARD-3, CARD-4, CARD-5, or CARD-6 wild type protein. In 
addition, the anti-CARD-3, CARD-4, CARD-5, or CARD-6 antibodies of the 
invention can be used to detect and isolate CARD-3, CARD-4, CARD-5, or 
CARD-6 proteins and modulate CARD-3, CARD-4, CARD-5, or CARD-6 

1 5 activity. 

This invention further pertains to novel agents identified by the above- 
described screening assays and uses thereof for treatments as described herein. 

A. Screening Assays 
2 0 The invention provides a method (also referred to herein as a 

"screening assay") for identifying modulators, i.e., candidate or test compounds 
or agents (e.g., peptides, peptidomimetics, small molecules or other drugs) which 
bind to CARD-3, CARD-4, CARD-5. or CARD-6 proteins or biologically active 
portions thereof or have a stimulatory or inhibitory effect on. for example. 

2 5 CARD-3. CARD-4, CARD-5. or CARD-6 expression or CARD-3, CARD-4, 

CARD-5, or CARD-6 activity. An example of a biologically active portion of 
human CARD-4 is amino acids 1-145 encoding the CARD domain which is 
sufficient to exhibit CARD-3-binding activity as described in Example 7. Amino 
acids 406-953 of human CARD-4L comprising the leucine rich repeat domain 

3 0 represent a biologically active portion of CARD-4L because they possess 
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hNUDC-binding activity as described in Example 8. An example of a 
biologically active portion of human CARD-5 is amino acids 1 1 1-881 (SEQ ID 
NO: 58) encoding the CARD domain. 

Among the screening assays provided by the invention are screening 
5 to identify molecules that prevent the dimerization of a CARD- containing 
polypeptide of the invention, screening to identify molecules which block the 
binding of a CARD containing polypeptide to a CARD-containing polypeptide of 
the invention (e.g., CARD-4), screening to identify a competitive inhibitor of the 
binding of a nucleotide to the nucleotide binding site of a CARD-containing 

10 polypeptide of the invention, e.g., human CARD-4L, screening to identify 
compounds which block the interaction between the leucine-rich repeat of a 
CARD-containing polypeptide of the invention and a ligand which binds to the 
leucine-rich repeat. 

For CARD-6 screening assays can be used to identify molecules 

1 5 which modulate a CARD-6 mediated increase in transcription of genes having an 
AP-1 OR nf-KB binding site. For example, expression of a reporter under the 
control of NF-kB (or AP-1) is measured in the presence and absence of a 
candidate molecule and in the presence and absence of CARD-6 to identify those 
molecules which alter expression of the reporter in a CARD-6 dependent manner. 

2 0 In addition, screening assays can be used to identify molecules which modulate a 
CARD-6 mediated increase in CHOP phosphorylation. For example, the 
expression of a reporter gene under the control of CHOP is measured in the 
presence and absence of a candidate small molecule and in the presence and 
absence of CARD-6 to identify those molecules which alter expression of the 

2 5 reporter in a CARD-6 dependent manner. A screening assay can be carried out to 

identify molecules which modulate the CARD-6 mediated increase in CHOP 
phosphorylation. For example, CHOP phosphorylation is measured in the 
presence and absence of a candidate molecule and in the presence and absence of 
CARD-6. Phosphorylation of CHOP can be measured using an antibody which 

3 0 binds to phosphorylated CHOP, but not to non-phosphorylated CHOP. 
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In one embodiment, the invention provides assays for screening 
candidate or test compounds which bind to or modulate the activity of a CARD- 
3, CARD-4, CARD-5, or CARD-6 proteins or polypeptides or biologically active 
portions thereof. The test compounds of the present invention can be obtained 
5 using any of the numerous approaches in combinatorial library methods known in 
the art, including: biological libraries; spatially addressable parallel solid phase or 
solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using 
affinity chromatography selection. The biological library approach is limited to 

1 0 peptide libraries, while the other four approaches are applicable to peptide, non- 
peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12:145). Examples of methods for the synthesis of 
molecular libraries can be found in the art, for example in: DeWitt et al. (1993) 
Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb et al. (1994) Proc. Natl. Acad. Sci. 

15 USA 91:1 1422; Zuckermann et al. (1994). J. Med. Chem. 37:2678; Cho et al. 
(1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. Engl. 
33:2059: Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et 
al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten 

20 (1992) Bio/Techniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84). 
chips (Fodor (1993) Nature 364:555-556). bacteria (U.S. Patent No. 5.223.409). 
spores (Patent Nos. 5.571,698; 5,403,484; and 5,223.409), plasmids (Cull et al. 
( 1 992) Proc. Natl. Acad. Sci. USA 89: 1 865-1 869) or on phage (Scott and Smith 
(1990) Science 249:386-390; Devlin (1990) Science 249:404-406; Cwirla et al. 

25 (1990) Proc. Natl. Acad. Sci. 87:6378-6382; and Felici (1991) J. Mol. Biol. 
222:301-310). 

Determining the ability of the test compound to modulate the activity 
of CARD-3. CARD-4. CARD-5. or CARD-6 or a biologically active portion 
thereof can be accomplished, for example, by determining the ability of the 
3 0 CARD-3. CARD-4, CARD-5. or CARD-6 protein to bind to or interact with a 
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CARD-3, CARD-4, CARDS, or CARD-6 target molecule. As used herein, a 
"target molecule" is a molecule with which a CARD-3, CARD-4, CARD-5, or 
CARD-6 protein binds or interacts in nature, for example, a molecule associated 
with the internal surface of a cell membrane or a cytoplasmic molecule. A 
5 CARD-3, CARD-4, CARD-5, or CARD-6 target molecule can be a non-CARD- 
3, CARD-4, CARD-5. or CARD-6 molecule or a CARD-3, CARD-4, CARD-5, 
or CARD-6 protein or polypeptide of the present invention. In one embodiment, 
a CARD-3, CARD-4. CARD-5, or CARD-6 target molecule is a component of an 
apoptotic signal transduction pathway, e.g., CARD-3 and CARD-4. The target, 

1 0 for example, can be a second intracellular protein which has catalytic activity or a 
protein which facilitates the association of downstream signaling molecules with 
CARD-3, CARD-4, CARD-5, or CARD-6. In another embodiment, CARD-3, 
CARD-4, CARD-5, or CARD-6 target molecules include CARD-3 because 
CARD-3 was found to bind to CARD-4 (Examples 7 and 12) and hNUDC 

15 because hNUDC was found to bind to CARD-4 (Example 8). 

Determining the ability of the test compound to modulate the activity 
of CARD-3. CARD-4, CARD-5. or CARD-6 or a biologically active portion 
thereof can be accomplished, for example, by determining the ability of the 
CARD-3, CARD-4. CARD-5. or CARD-6 protein to bind to or interact with any 

2 0 of the specific proteins listed in the previous paragraph as CARD-3, CARD-4, 
CARD-5. or CARD-6 target molecules. In another embodiment, CARD-3. 
CARD-4, CARD-5. or CARD-6 target molecules include all proteins that bind to 
a CARD-3, CARD-4, CARD-5, or CARD-6 protein or a fragment thereof in a 
two-hybrid system binding assay which can be used without undue 

2 5 experimentation to isolate such proteins from cDNA or genomic two-hybrid 

system libraries. For example, Example 7 describes the use of the CARD-4 
CARD domain region to identify CARD-3 in a two-hybrid screen and Example 8 
describes the use of the CARD-4 leucine rich repeat domain region to identify 
hNUDC in a two-hybrid screen. The binding assays described in this section can 

3 0 be cell-based or cell free (described subsequently). 
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Determining the ability of the CARD-3, CARD-4, CARD-5, or 
CARD-6 protein to bind to or interact with a CARD-3, CARD-4, CARD-5, or 
CARD-6 target molecule can be accomplished by one of the methods described 
above for determining direct binding. In an embodiment, determining the ability 
5 of the CARD-3, CARD-4, CARD-5, or CARD-6 protein to bind to or interact 
with a CARD-3, CARD-4, CARD-5, or CARD-6 target molecule can be 
accomplished by determining the activity of the target molecule. For example, 
the activity of the target molecule can be determined by detecting induction of a 
cellular second messenger of the target (e.g., intracellular Ca2+, diacylglycerol, 

1 0 IP3. etc.), detecting catalytic/enzymatic activity of the target on an appropriate 
substrate, detecting the induction of a reporter gene (e.g., a CARD-3. CARD-4, 
CARD-5, or CARD-6-responsive regulatory element operatively linked to a 
nucleic acid encoding a detectable marker, e.g. luciferase), or detecting a cellular 
response, for example, cell survival, cellular differentiation, or cell proliferation. 

1 5 For example, in Example 12 CARD-4 is shown to bind to CARD-3 and in 
Example 10, by monitoring a cellular response, CARD-4 is shown to enhance 
caspase 9 activity, cell death or apoptosis. Because CARD-3 and CARD-4 
enhance caspase 9 activity, activity can be monitored by assaying the caspase 9- 
mediated apoptosis cellular response or caspase 9 enzymatic activity. In addition. 

2 0 and in another embodiment, genes induced by CARD-3, CARD-4, CARD-5, or 
CARD-6 expression can be identified by expressing CARD-3, CARD-4, CARD- 
5. or CARD-6 in a cell line and conducting a transcriptional profiling experiment 
wherein the mRNA expression patterns of the cell line transformed with an 
empty expression vector and the cell line transformed with a CARD-3, CARD-4, 

2 5 CARD-5. or CARD-6 expression vector are compared. The promoters of genes 

induced by CARD-3, CARD-4. CARD-5, or CARD-6 expression can be 
operatively linked to reporter genes suitable for screening such as luciferase, 
secreted alkaline phosphatase, or beta-galactosidase and the resulting constructs 
could be introduced into appropriate expression vectors. A recombinant cell line 

3 0 containing CARD-3 , CARD-4, CARD-5, or CARD-6 and transfected with an 
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expression vector containing a CARD-3, CARD-4, CARD-5, or CARD-6 
responsive promoter operatively linked to a reporter gene can be used to identify 
test compounds that modulate CARD-3, CARD-4, CARD-5, or CARD-6 activity 
by assaying the expression of the reporter gene in response to contacting the 
5 recombinant cell line with test compounds. CARD-3, CARD-4, CARD-5, or 
CARD-6 agonists can be identified as increasing the expression of the reporter 
gene and CARD-3, CARD-4, CARD-5, or CARD-6 antagonists can be identified 
as decreasing the expression of the reporter gene. 

In another embodiment of the invention, the ability of a test 

1 0 compound to modulate the activity of CARD-3, CARD-4, or biologically active 
portions thereof can be determined by assaying the ability of the test compound 
to modulate CARD-3. CARD-4, CARD-5, or CARD-6-dependent pathways or 
processes where the CARD-3, CARD-4, CARD-5, or CARD-6 target proteins 
that mediate the CARD-3, CARD-4, CARD-5, or CARD-6 effect are known or 

1 5 unknown. Potential CARD-3, CARD-4, CARD-5, or CARD-6-dependent 

pathways or processes include, but are not limited to, the modulation of cellular 
signal transduction pathways and their related second messenger molecules (e.g., 
intracellular Ca 2+ . diacylglycerol, IP3, cAMP etc.), cellular enzymatic activities, 
cellular responses (e.g.. cell survival, cellular differentiation, or cell 

2 0 proliferation), or the induction or repression of cellular or heterologous mRNAs 
or proteins. CARD-3. CARD-4, CARD-5, or CARD-6-dependent pathways or 
processes could be assayed by standard cell-based or cell free assays appropriate 
for the specific pathway or process under study. For example, Example 9 
describes how expression of CARD-4S or CARD-4L in 293T cells induces the 

2 5 NF-kB pathway as determined by the measurement of a cotransfected NF-kB 

pathway luciferase reporter gene. In another embodiment, cells cotransfected 
with CARD-4 and the NF-kB luciferase reporter gene could be contacted with a 
test compound and test compounds that block CARD-4 activity could be 
identified by their reduction of CARD-4-dependent NF-kB pathway luciferase 

3 0 reporter gene expression. Test compounds that agonize CARD-4 would be 
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expected to increase reporter gene expression. In another embodiment, CARD-4 
could be expressed in a cell line and the recombinant CARD-4-expressing cell 
line could be contacted with a test compound. Test compounds that inhibit 
CARD-4 activity could be indentified by their reduction of CARD-4-depended 
5 NF-kB pathway stimulation as measured by the assay of a NF-kB pathway 

reporter gene. NF-kB nuclear localization, IkB phosphorylation or proteolysis, or 
other standard assays for NF-kB pathway activation known to those skilled in the 
art. 

In yet another embodiment, an assay of the present invention is a cell- 
1 0 free assay comprising contacting a CARD-3, CARD-4, CARD-5, or CARD-6 
protein or biologically active portion thereof with a test compound and 
determining the ability of the test compound to bind to the CARD-3, CARD-4, 
CARD-5, or CARD-6 protein or biologically active portion thereof. Binding of 
the test compound to the CARD-3, CARD-4, CARD-5, or CARD-6 protein can 
15 be determined either directly or indirectly as described above. In one 

embodiment, a competitive binding assay includes contacting the CARD-3, 
CARD-4, CARD-5, or CARD-6 protein or biologically active portion thereof 
with a compound known to bind CARD-3. CARD-4, CARD-5. or CARD-6 to 
form an assay mixture, contacting the assay mixture with a test compound, and 
2 0 determining the ability of the test compound to interact with a CARD-3. CARD- 
4. CARD-5. or CARD-6 protein, wherein determining the ability of the test 
compound to interact with a CARD-3. CARD-4, CARD-5. or CARD-6 protein 
comprises determining the ability of the test compound to preferentially bind to 
CARD-3, CARD-4, CARD-5, or CARD-6 or biologically active portion thereof 

2 5 as compared to the known binding compound. 

In another embodiment, an assay is a cell-free assay comprising 
contacting CARD-3, CARD-4, CARD-5. or CARD-6 protein or biologically 
active portion thereof with a test compound and determining the ability of the test 
compound to modulate (e.g.. stimulate or inhibit) the activity of the CARD-3, 

3 0 CARD-4. CARD-5. or CARD-6 protein or biologically active portion thereof. 
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Determining the ability of the test compound to modulate the activity of CARD- 

3. CARD-4, CARD-5, or CARD-6 can be accomplished, for example, by 
determining the ability of the CARD-3, CARD-4, CARD-5, or CARD-6 protein 
to bind to a CARD-3, CARD-4. CARD-5, or CARD-6 target molecule by one of 

5 the methods described above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound to modulate the 
activity of CARD-3, CARD-4, CARD-5, or CARD-6 can be accomplished by 
determining the ability of the CARD-3, CARD-4, CARD-5. or CARD-6 protein 
to further modulate a CARD-3, CARD-4, CARD-5, or CARD-6 target molecule. 

10 For example, the catalytic/enzymatic activity of the target molecule on an 
appropriate substrate can be determined as previously described. 

In yet another embodiment, the cell-free assay comprises contacting 
the CARD-3, CARD-4, CARD-5, or CARD-6 protein or biologically active 
portion thereof with a known compound which binds CARD-3. CARD-4, 

15 CARD-5. or CARD-6 to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the test compound to 
interact with a CARD-3, CARD-4. CARD-5. or CARD-6 protein, wherein 
determining the ability of the test compound to interact with a CARD-3. CARD- 

4, CARD-5. or CARD-6 protein comprises determining the ability of the CARD- 
2 0 3, CARD-4, CARD-5, or CARD-6 protein to preferentially bind to or modulate 

the activity of a CARD-3, CARD-4. CARD-5, or CARD-6 target molecule. The 
cell-free assays of the present invention are amenable to use of both the soluble 
form or the membrane-associated form of CARD-3, CARD-4. CARD-5, or 
CARD-6. A membrane-associated form of CARD-3, CARD-4, CARD-5, or 

2 5 CARD-6 refers to CARD-3. CARD-4, CARD-5. or CARD-6 that interacts with a 

membrane-bound target molecule. In the case of cell-free assays comprising the 
membrane-associated form of CARD-3. CARD-4, CARD-5, or CARD-6. it may- 
be desirable to utilize a solubilizing agent such that the membrane-associated 
form of CARD-3. CARD-4. CARD-5. or CARD-6 is maintained in solution. 

3 0 Examples of such solubilizing agents include non-ionic detergents such as n- 
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octylglucoside, n-dodecylglucoside. n-dodecylmaltoside, octanoyl-N- 
methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, 
Thesit®, Isotridecypoly(ethylene glycol ether)n, 3-[(3- 
cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 
5 cholamidopropyl)dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), 
or N-dodecyl=N,N-dimethyI-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the 
present invention, it may be desirable to immobilize either CARD-3, CARD-4. 
CARD-5, or CARD-6 or its target molecule to facilitate separation of complexed 

10 from uncomplexed forms of one or both of the proteins, as well as to 

accommodate automation of the assay. Binding of a test compound to CARD-3, 
CARD-4, CARD-5, or CARD-6, or interaction of CARD-3, CARD-4, CARD-5. 
or CARD-6 with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the 

15 reactants. Examples of such vessels include microtitre plates, test tubes, and 
micro-centrifuge tubes. In one embodiment, a fusion protein can be provided 
which adds a domain that allows one or both of the proteins to be bound to a 
matrix. For example. glutathione-S-transferase/CARD-3, CARD-4, CARD-5, or 
CARD-6 fusion proteins or glutathione-S-transferase/target fusion proteins can 

2 0 be adsorbed onto glutathione sepharose beads (Sigma Chemical; St. Louis. MO) 
or glutathione derivatized microtitre plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or 
CARD-3, CARD-4, CARD-5, or CARD-6 protein, and the mixture incubated 
under conditions conducive to complex formation (e.g., at physiological 

2 5 conditions for salt and pH). Following incubation, the beads or microtitre plate 

wells are washed to remove any unbound components, the matrix immobilized in 
the case of beads, complex determined either directly or indirectly, for example, 
as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of CARD-3. C ARD-4. CARD-5, or CARD-6 binding or 

3 0 activity determined using standard techniques. In an alternative embodiment, 
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MYC or HA epitope tag CARD-3 or CARD-4 fusion proteins or MYC or HA 
epitope tag target fusion proteins can be adsorbed onto anti-MYC or anti-HA 
antibody coated microbeads or onto anti-MYC or anti-HA antibody coated 
microtitre plates, which are then combined with the test compound or the test 
5 compound and either the non-adsorbed target protein or CARD-3 or CARD-4 
protein, and the mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microtitre plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined 

10 either directly or indirectly, for example, as described above. Alternatively, the 
complexes can be dissociated from the matrix, and the level of CARD-3 or 
CARD-4 binding or activity determined using standard techniques. Example 12 
describes an HA epitope tagged CARD-4 protein that physically interacts in a 
coimmunoprecipitation assay with MYC epitope tagged CARD-3. In an 

15 embodiment of the invention, HA epitope tagged CARD-4 could be used in 

combination with MYC epitope CARD-3 in the sort of protein-protein interaction 
assay described earlier in this paragraph. 

Other techniques for immobilizing proteins on matrices can also be 
used in the screening assays of the invention. For example, CARD-3. CARD-4. 

2 0 CARD-5. or CARD-6 or its target molecule can be immobilized utilizing 
conjugation of biotin and streptavidin. Biotinylated CARD-3 or CARD-4 or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) 
using techniques well known in the art (e.g., biotinylation kit. Pierce Chemicals; 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates 

2 5 (Pierce Chemical). Alternatively, antibodies reactive with CARD-3, CARD-4, 

CARD-5, CARD-6 or target molecules but which do not interfere with binding of 
the protein to its target molecule can be derivatized to the wells of the plate, and 
unbound target or protein trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those described above for the GST- 

3 0 immobilized complexes and epitope tag immobilized complexes, include 
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immunodetection of complexes using antibodies reactive with the CARD-3, 
CARD-4, CARD-5, or CARD-6 or target molecule, as well as enzyme-linked 
assays which rely on detecting an enzymatic activity associated with the CARD- 
S' CARD-4, CARD-5, CARD-6 or target molecule. 
5 In another embodiment, modulators of CARD-3. CARD-4, CARD-5, 

or CARD-6 expression are identified in a method in which a cell is contacted 
with a candidate compound and the expression of the CARD-3. CARD-4, 
CARD-5. or CARD-6 promoter. mRNA or protein in the cell is determined. The 
level of expression of CARD-3, CARD-4. CARD-5. or CARD-6 mRNA or 

1 0 protein in the presence of the candidate compound is compared to the level of 
expression of CARD-3. CARD-4, CARD-5. or CARD-6 mRNA or protein in the 
absence of the candidate compound. The candidate compound can then be 
identified as a modulator of CARD-3, CARD-4, CARD-5, or CARD-6 
expression based on this comparison. For example, when expression of CARD- 

15 3. CARD-4, CARD-5, or CARD-6 mRNA or protein is greater (statistically 
significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of CARD-3, 
CARD-4, CARD-5. or CARD-6 mRNA or protein expression. Alternatively, 
when expression of CARD-3, CARD-4, CARD-5, or CARD-6 mRNA or protein 

2 0 is less (statistically significantly less) in the presence of the candidate compound 

than in its absence, the candidate compound is identified as an inhibitor of 
CARD-3, CARD-4, CARD-5, or CARD-6 mRNA or protein expression. The 
level of CARD-3, CARD-4, CARD-5, or CARD-6 mRNA or protein expression 
in the cells can be determined by methods described herein for detecting CARD- 
25 3, CARD-4, CARD-5, or CARD-6 mRNA or protein. The activity of the CARD- 
3. CARD-4, CARD-5, or CARD-6 promoter can be assayed by linking the 
CARD-3, CARD-4. CARD-5, or CARD-6 promoter to a reporter gene such as 
luciferase, secreted alkaline phosphatase, or beta-galactosidase and introducing 
the resulting construct into an appropriate vector, transfecting a host cell line, and 

3 0 measuring the activity of the reporter gene in response to test compounds. For 
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example, two CARD-4-specific mRNAs were detected in a Northern blotting 
experiment, one of 4.6 kilobases and the other of 6.5-7.0 kilobases (Example 11). 
In Example 1 1, CARD-4-specific mRNA species were found to be widely 
distributed in the tissues and cell lines studied. 
5 In yet another aspect of the invention, the CARD-3, CARD-4. CARD- 

5, or CARD-6 proteins can be used as "bait proteins" in a two-hybrid assay or 
three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos et al. (1993) Cell 
72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel et al. 
(1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693- 

1 0 1 696; and PCT Publication No. WO 94/1 0300), to identify other proteins, which 
bind to or interact with CARD-3, CARD-4, CARD-5, or CARD-6 ("CARD-3, 
CARD-4, CARD-5, or CARD-6-binding proteins" or "CARD-3. CARD-4, 
CARD-5, or CARD-6-bp") and modulate CARD-3, CARD-4, CARD-5. or 
CARD-6 activity. Such CARD-3, CARD-4, CARD-5, or CARD-6-binding 

1 5 proteins are also likely to be involved in the propagation of signals by the CARD- 
S' CARD-4, CARD-5, or CARD-6 proteins as. for example, upstream or 
downstream elements of the CARD-3, CARD-4, CARD-5, or CARD-6 pathway. 
For example. Example 7 describes the construction of a two-hybrid screening bait 
construct including human CARD-4L amino acids 1-145 comprising the CARD 

2 0 domain and the use of this bait construct to screen human mammary gland and 

prostate gland two-hybrid libraries resulting in the identification of human 
CARD-3 as a CARD-4 interacting protein. In another example, Example 8 
describes the construction of a two-hybrid screening bait construct including 
human CARD-4 amino acids 406-953 comprising the LRU domain and the use of 
25 this bait construct to screen a human mammary gland two-hybrid libraries 
resulting in the identification of hNUDC as a CARD-4 interacting protein. 

The two-hybrid system is based on the modular nature of most 
transcription factors, which consist of separable DNA-binding and activation 
domains. Briefly, the assay utilizes two different DNA constructs. In one 

3 0 construct, the gene that codes for CARD-3, CARD-4. CARD-5. or CARD-6 is 
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fused to a gene encoding the DNA binding domain of a known transcription 
factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library of 
DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
fused to a gene that codes for the activation domain of the known transcription 
5 factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming 
an CARD-3, CARD-4, CARD-5, or CARD-6-dependent complex, the DNA- 
binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) 
which is operably linked to a transcriptional regulatory site responsive to the 

1 0 transcription factor. Expression of the reporter gene can be detected and cell 

colonies containing the functional transcription factor can be isolated and used to 
obtain the cloned gene which encodes the protein which interacts with CARD-3. 
CARD-4, CARD-5, or CARD-6. 

In an embodiment of the invention, the ability of a test compound to 

15 modulate the activity of CARD-3, CARD-4, CARD-5, or CARD-6, or a 

biologically active portion thereof can be determined by assaying the ability of 
the test compound to block the binding of CARD-3, CARD-4, CARD-5, or 
CARD-6 to its target proteins in a two-hybrid system assay. Example 7 describes 
a two-hybrid system assay for the interaction between CARD-3 and CARD-4 and 

2 0 Example 8 describes a two-hybrid system assay for the interaction between 

CARD-4 and its target protein hNUDC. To screen for test compounds that block 
the interaction between CARD-3 and CARD-4 and their target proteins, which 
include but are not limited to CARD-3, CARD-4, and hNUDC, a yeast two- 
hybrid screening strain coexpressing the interacting bait and prey constructs, for 

2 5 example, a CARD-4 bait construct and a CARD-3 prey construct as described in 

Example 7, is contacted with the test compound and the activity of the two- 
hybrid system reporter gene, usually HIS3, lacZ, or URA3 is assayed. If the 
strain remains viable but exhibits a significant decrease in reporter gene activity, 
this would indicate that the test compound has inhibited the interaction between 

3 0 the bait and prey proteins. This assay could be automated for high throughput 
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drug screening purposes. In another embodiment of the invention,CARD-3. 
CARD-4, CARD-5, or CARD-6 and their target proteins could be configured in 
the reverse two-hybrid system (Vidal et al. (1996) Proc. Natl. Acad. Sci. USA 
93:10321-6 and Vidal et al. (1996) Proc. Natl. Acad. Sci. USA 93:10315-20) 
5 designed specifically for efficient drug screening. In the reverse two-hybrid 
system, inhibition of a CARD-3 or CARD-4 physical interaction with a target 
protein would result in induction of a reporter gene in contrast to the normal two- 
hybrid system where inhibition of CARD-3, CARD-4, CARD-5, or CARD-6 
physical interaction with a target protein would lead to reporter gene repression. 

1 0 The reverse two-hybrid system is preferred for drug screening because reporter 
gene induction is more easily assayed than report gene repression. 

Alternative embodiments of the invention are proteins found to 
physically interact with proteins that bind to CARD-3, CARD-4, CARD-5. or 
CARD-6. CARD-3, CARD-4. CARD-5. or CARD-6 interactors, including but 

1 5 not limited to hNUDC and CARD-3, could be configured into two-hybrid system 
baits and used in two-hybrid screens to identify additional members of the 
CARD-3, CARD-4, CARD-5, or CARD-6 pathway. The interactors of CARD-3. 
CARD-4. CARD-5, or CARD-6 interactors identified in this way could be useful 
targets for therapeutic intervention in CARD-3. CARD-4. CARD-5. or CARD-6 

2 0 related diseases and pathologies and an assay of their enzymatic or binding 

activity could be useful for the identification of test compounds that modulate 
CARD-3, CARD-4, CARD-5, or CARD-6 activity. 

This invention further pertains to novel agents identified by the above- 
described screening assays and uses thereof for treatments as described herein. 

25 

B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and 
the corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map 

3 0 their respective genes on a chromosome; and. thus, locate gene regions associated 
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with genetic disease; (ii) identify an individual from a minute biological sample 
(tissue typing); and (iii) aid in forensic identification of a biological sample. 
These applications are described in the subsections below. 
1 . Chromosome Mapping 
5 Once the sequence (or a portion of the sequence) of a gene has been 

isolated, this sequence can be used to map the location of the gene on a 
chromosome. Accordingly, CARD-3, CARD-4, CARD-5, or CARDS nucleic 
acid molecules described herein or fragments thereof, can be used to map the 
location of CARD-3. CARD-4, CARD-5, or CARD-6 genes on a chromosome. 

1 0 The mapping of the CARD-3, CARD-4, CARD-5, or CARD-6 sequences to 

chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 

Briefly, CARD-3, CARD-4, CARD-5, or CARD-6 genes can be 
mapped to chromosomes by preparing PCR primers (preferably 1 5-25 bp in 

1 5 length) from the CARD-3, CARD-4, CARD-5, or CARD-6 sequences. 

Computer analysis of CARD-3, CARD-4. CARD-5, or CARD-6 sequences can 
be used to rapidly select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can 
then be used for PCR screening of somatic cell hybrids containing individual 

2 0 human chromosomes. Only those hybrids containing the human gene 

corresponding to the CARD-3, CARD-4, CARD-5, or CARD-6 sequences will 
yield an amplified fragment. For example, in Example 6, human CARD-4- 
specific PCR primers were used to screen DNAs from a somatic cell hybrid panel 
showing that human CARD-4 maps to chromosome 7 close to the SHGC-3 1928 

2 5 genetic marker. 

Somatic cell hybrids are prepared by fusing somatic cells from 
different mammals (e.g.. human and mouse cells). As hybrids of human and 
mouse cells grow and divide, they gradually lose human chromosomes in random 
order, but retain the mouse chromosomes. By using media in which mouse cells 

3 0 cannot grow, because they lack a particular enzyme, but human cells can, the one 
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human chromosome that contains the gene encoding the needed enzyme, will be 
retained. By using various media, panels of hybrid cell lines can be established. 
Each cell line in a panel contains either a single human chromosome or a small 
number of human chromosomes, and a full set of mouse chromosomes, allowing 
5 easy mapping of individual genes to specific human chromosomes. (D'Eustachio 
et al. (1983) Science 220:919-924). Somatic cell hybrids containing only 
fragments of human chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for 

1 0 assigning a particular sequence to a particular chromosome. Three or more 

sequences can be assigned per day using a single thermal cycler. Using the 
CARD-3, CARD-4. CARD-5, or CARD-6 sequences to design oligonucleotide 
primers, sublocalization can be achieved with panels of fragments from specific 
chromosomes. Other mapping strategies which can similarly be used to map a 
1 5 CARD-3, CARD-4, CARD-5, or CARD-6 sequence to its chromosome include in 
situ hybridization (described in Fan et al. (1990) Proc. Natl. Acad. Sci. USA 
87:6223-27), pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a 

2 0 mctaphase chromosomal spread can further be used to provide a precise 

chromosomal location in one step. Chromosome spreads can be made using cells 
whose division has been blocked in metaphase by a chemical like colcemid that 
disrupts the mitotic spindle. The chromosomes can be treated briefly with 
trypsin, and then stained with Giemsa. A pattern of light and dark bands 

2 5 develops on each chromosome, so that the chromosomes can be identified 

individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1.000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal 
intensity for simple detection. Preferably 1 .000 bases, and more preferably 2.000 

3 0 bases will suffice to get good results at a reasonable amount of time. For a 
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review of this technique, see Verma et al., (Human Chromosomes: A Manual of 
Basic Techniques (Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used individually to mark a 
single chromosome or a single site on that chromosome, or panels of reagents can 
5 be used for marking multiple sites and/or multiple chromosomes. Reagents 
corresponding to noncoding regions of the genes actually are preferred for 
mapping purposes. Coding sequences are more likely to be conserved within 
gene families, thus increasing the chance of cross hybridizations during 
chromosomal mapping. 

1 0 Once a sequence has been mapped to a precise chromosomal location, 

the physical position of the sequence on the chromosome can be correlated with 
genetic map data. (Such data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man, available on-line through Johns Hopkins 
University Welch Medical Library). The relationship between genes and disease, 

15 mapped to the same chromosomal region, can then be identified through linkage 
analysis (co-inheritance of physically adjacent genes), described in, e.g.. Egeland 
et al. (1987) Nature. 325:783-787. 

Moreover, differences in the DNA sequences between individuals 
affected and unaffected with a disease associated with the CARD-3. CARD-4, 

2 0 CARD-5, or CARD-6 gene can be determined. If a mutation is observed in some 
or all of the affected individuals but not in any unaffected individuals, then the 
mutation is likely to be the causative agent of the particular disease. Comparison 
of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes such as deletions or translocations that 

2 5 are visible from chromosome spreads or detectable using PCR based on that 
DNA sequence. Ultimately, complete sequencing of genes from several 
individuals can be performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 
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2. Tissue Typing 

The CARD-3. CARD-4. CARD-5, or CARD-6 sequences of the 
present invention can also be used to identify individuals from minute biological 
samples. The United States military, for example, is considering the use of 
5 restriction fragment length polymorphism (RFLP) for identification of its 

personnel. In this technique, an individual's genomic DNA is digested with one 
or more restriction enzymes, and probed on a Southern blot to yield unique bands 
for identification. This method does not suffer from the current limitations of 
"Dog Tags" which can be lost, switched, or stolen, making positive identification 

10 difficult. The sequences of the present invention are useful as additional DNA 
markers for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to 
provide an alternative technique which determines the actual base-by-base DNA 
sequence of selected portions of an individual's genome. Thus, the CARD-3, 

1 5 CARD-4, CARD-5, or CARD-6 sequences described herein can be used to 

prepare two PCR primers from the 5' and 3' ends of the sequences. These primers 
can then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in 
this manner, can provide unique individual identifications, as each individual will 

2 0 have a unique set of such DNA sequences due to allelic differences. The 
sequences of the present invention can be used to obtain such identification 
sequences from individuals and from tissue. The CARD-3. CARD-4, CARD-5. 
or CARD-6 sequences of the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the coding regions of these 

2 5 sequences, and to a greater degree in the noncoding regions. It is estimated that 

allelic variation between individual humans occurs with a frequency of about 
once per each 500 bases. Each of the sequences described herein can. to some 
degree, be used as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater numbers of 

3 0 polymorphisms occur in the noncoding regions, fewer sequences are necessary to 
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differentiate individuals. The noncoding sequences of SEQ ID NO:l. SEQ ID 
NO:7, SEQ ID NO:25, SEQ IDNO:38. SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
NO:48, SEQ ID NO:51. SEQ ID NO:54, and SEQ ID NO:60 can comfortably 
provide positive individual identification with a panel of perhaps 10 to 1.000 
5 primers which each yield a noncoding amplified sequence of 1 00 bases. If 

predicted coding sequences, such as those in SEQ ID NO:3, SEQ ID NO:9, SEQ 
ID NO:27, SEQ ID NO:50, SEQ ID NO:53, SEQ ID NO:56, and SEQ ID NO:62 
are used, a more appropriate number of primers for positive individual 
identification would be 500-2.000. 

10 If a panel of reagents from CARD-3. CARD-4. CARD-5, or CARD-6 

sequences described herein is used to generate a unique identification database 
for an individual, those same reagents can later be used to identify tissue from 
that individual. Using the unique identification database, positive identification 
of the individual, living or dead, can be made from extremely small tissue 

15 samples. 

3. Use of Partial Sequences in Forensic Biology 
DNA-based identification techniques can also be used in forensic 
biology. Forensic biology is a scientific field employing genetic typing of 
biological evidence found at a crime scene as a means for positively identifying, 

2 0 for example, a perpetrator of a crime. To make such an identification, PCR 
technology can be used to amplify DNA sequences taken from very small 
biological samples such as tissues, e.g.. hair or skin, or body fluids, e.g., blood, 
saliva, or semen found at a crime scene. The amplified sequence can then be 
compared to a standard, thereby allowing identification of the origin of the 

2 5 biological sample. 

The sequences of the present invention can be used to provide 
polynucleotide reagents, e.g., PCR primers, targeted to specific loci in the human 
genome, which can enhance the reliability of DNA-based forensic identifications 
by, for example, providing another "identification marker" (i.e. another DNA 

30 sequence that is unique to a particular individual). As mentioned above, actual 
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base sequence information can be used for identification as an accurate 
alternative to patterns formed by restriction enzyme generated fragments. 
Sequences targeted to noncoding regions of SEQ ID NO:l. SEQ ID NO:7. SEQ 
ID NO:25, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:48. 
5 SEQ ID NO:5 1, SEQ ID NO: 54, and SEQ ID NO:60 are particularly appropriate 
for this use as greater numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this technique. Examples of 
polynucleotide reagents include the CARD-3. CARD-4, CARD-5, or CARD-6 
sequences or portions thereof, e.g., fragments derived from the noncoding regions 

10 of SEQ ID NO: 1 . SEQ ID NO:7, SEQ ID NO:25, SEQ ID NO:38, SEQ ID 
NO:40. SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:51, SEQ ID NO:54. and 
SEQ ID NO:60 which have a length of at least 20 or 30 bases. 

The sequences described herein can further be used to provide 
polynucleotide reagents, e.g., labeled or labelable probes which can be used in. 

15 for example, an in situ hybridization technique, to identify a specific tissue, e.g.. 
brain tissue. This can be very useful in cases where a forensic pathologist is 
presented with a tissue of unknown origin. Panels of such CARD-3. CARD-4. 
CARD-5, or CARD-6 probes can be used to identify tissue by species and/or by 
organ type. 

2 0 In a similar fashion, these reagents, e.g.. CARD-3, CARD-4. CARD- 

5. or CARD-6 primers or probes can be used to screen tissue culture for 
contamination (i.e., screen for the presence of a mixture of different types of cells 
in a culture). 

2 5 C. Predictive Medicine 

The present invention also pertains to the field of predictive medicine 
in which diagnostic assays, prognostic assays, pharmacogenomics, and 
monitoring clinical trials are used for prognostic (predictive) purposes to thereby 
treat an individual prophylactically. Accordingly, one aspect of the present 

3 0 invention relates to diagnostic assays for determining CARD-3, CARD-4, 
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CARD-5, or CARD-6 protein and/or nucleic acid expression as well as CARD-3, 
CARD-4, CARD-5. or CARD-6 activity, in the context of a biological sample 
(e.g., blood, serum, cells, tissue) to thereby determine whether an individual is 
afflicted with a disease or disorder, or is at risk of developing a disorder, 
5 associated with aberrant CARD-3, CARD-4, CARD-5, or CARD-6 expression or 
activity. The invention also provides for prognostic (or predictive) assays for 
determining whether an individual is at risk of developing a disorder associated 
with CARD-3, CARD-4, CARD-5. or CARD-6 protein, nucleic acid expression 
or activity. For example, mutations in a CARD-3. CARD-4. CARD-5, or CARD- 

10 6 gene can be assayed in a biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby prophylactically treat an individual 
prior to the onset of a disorder characterized by or associated with CARD-3, 
CARD-4, CARD-5. or CARD-6 protein, nucleic acid expression or activity. 

Another aspect of the invention provides methods for determining 

15 CARD-3, CARD-4, CARD-5, or CARD-6 protein, nucleic acid expression or 
CARD-3, CARD-4. CARD-5. or CARD-6 activity in an individual to thereby 
select appropriate therapeutic or prophylactic agents for that individual (referred 
to herein as "pharmacogenomics"). Pharmacogenomics allows for the selection 
of agents (e.g., drugs) for therapeutic or prophylactic treatment of an individual 

2 0 based on the genotype of the individual (e.g.. the genotype of the individual 
examined to determine the ability of the individual to respond to a particular 
agent.) 

Yet another aspect of the invention pertains to monitoring the 
influence of agents (e.g., drugs or other compounds) on the expression or activity 

2 5 of CARD-3. CARD-4, CARD-5, or CARD-6 in clinical trials. 

These and other agents are described in further detail in the following 

sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or absence of 

3 0 CARD-3, CARD-4. CARD-5. or CARD-6 in a biological sample involves 
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obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting CARD-3, CARD-4, 
CARD-5, or CARD-6 protein or nucleic acid (e.g., mRNA, genomic DNA) that 
encodes CARD-3, CARD-4, CARD-5, or CARD-6 protein such that the presence 
5 of CARD-3, CARD-4, CARD-5, or CARD-6 is detected in the biological sample. 
An agent for detecting CARD-3, CARD-4, CARD-5. or CARD-6 mRNA or 
genomic DNA is a labeled nucleic acid probe capable of hybridizing to CARD-3, 
CARD-4, CARD-5, or CARD-6 mRNA or genomic DNA. The nucleic acid 
probe can be. for example, a full-length CARD-3, CARD-4, CARD-5, or CARD- 

10 6 nucleic acid, such as the nucleic acid of SEQ ID NO:l or 3. SEQ ID NO:7 or 9. 
SEQ ID NO:25 or 27. SEQ ID NO:38. SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
NO:48 or 50, SEQ ID NO:51 or SEQ ID NO:53 or SEQ ID NO:54 or SEQ ID 
NO:56. SEQ ID NO:60 or 62. or a portion thereof, such as an oligonucleotide of 
at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 

1 5 specifically hybridize under stringent conditions to mRNA or genomic DNA, or a 
human CARD-4 splice variant such as the nucleic acid of SEQ ID NO:38 or SEQ 
ID NO:40. Other suitable probes for use in the diagnostic assays of the invention 
are described herein. For example, Example 1 1 describes the use of a nucleic 
acid probe to detect CARD-4 mRNAs in human tissues and cell lines and the 

2 0 probe used in this experiment could be used for a diagnostic assay. 

An agent for detecting CARD-3. CARD-4. CARD-5, or CARD-6 
protein can be an antibody capable of binding to CARD-3, CARD-4. CARD-5. or 
CARD-6 protein, preferably an antibody with a detectable label. Antibodies can 
be polyclonal, or more preferably, monoclonal. For example, polypeptides 

2 5 corresponding to amino acids 128-139 and 287-298 of human CARD-4L were 

used to immunize rabbits and produce polyclonal antibodies that specifically 
recognize human CARD-4L. An intact antibody, or a fragment thereof (e.g.. Fab 
or F(ab')2) can be used. The term "labeled", with regard to the probe or antibody, 
is intended to encompass direct labeling of the probe or antibody by coupling 

3 0 (i.e., physically linking) a detectable substance to the probe or antibody, as well 
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as indirect labeling of the probe or antibody by reactivity with another reagent 
that is directly labeled. Examples of indirect labeling include detection of a 
primary antibody using a fluorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be detected with 
5 fluorescently labeled streptavidin. The term "biological sample" is intended to 
include tissues, cells and biological fluids isolated from a subject, as well as 
tissues, cells and fluids present within a subject. That is, the detection method of 
the invention can be used to detect CARD-3. CARD-4, CARD-5. or CARD-6 
mRNA, protein, or genomic DNA in a biological sample in vitro as well as in 

1 0 vivo. For example, in vitro techniques for detection of CARD-3, CARD-4, 
CARD-5, or CARD-6 mRNA include Northern hybridizations and in situ 
hybridizations. For example. Example 1 1 contains the use of a human CARD-4L 
nucleic acid probe for a Northern blotting analysis of mRNA species encoded by 
human CARD-4L detected in RNA samples from human tissues and cell lines. 

1 5 In vitro techniques for detection of CARD-3 or CARD-4 protein include enzyme 
linked immunosorbent assays (ELISAs). Western blots, immunoprecipitations 
and immunofluorescence. In vitro techniques for detection of CARD-3, CARD- 
4. CARD-5, or CARD-6 genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of CARD-3. CARD-4, CARD-5. or 

2 0 CARD-6 protein include introducing into a subject a labeled anti-CARD-3, 
CARD-4, CARD-5, or CARD-6 antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules 

2 5 from the test subject. Alternatively, the biological sample can contain mRNA 

molecules from the test subject or genomic DNA molecules from the test subject. 
A biological sample is a peripheral blood leukocyte sample isolated by 
conventional means from a subject. 

In another embodiment, the methods further involve obtaining a 

3 0 control biological sample from a control subject, contacting the control sample 



- 107 - 



WO 01/00826 



PCT/US00/17691 



with a compound or agent capable of detecting CARD-3, CARD-4, CARD-5, or 
CARD-6 protein, mRNA. or genomic DNA, such that the presence of CARD-3, 
CARD-4, CARD-5, or CARD-6 protein, mRNA or genomic DNA is detected in 
the biological sample, and comparing the presence of CARD-3, CARD-4, 
5 CARD-5. or CARD-6 protein. mRNA or genomic DNA in the control sample 
with the presence of CARD-3. CARD-4, CARD-5. or CARD-6 protein, mRNA 
or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of 
CARD-3, CARD-4, CARD-5, or CARD-6 in a biological sample (a test sample). 

1 0 Such kits can be used to determine if a subject is suffering from or is at increased 
risk of developing a disorder associated with aberrant expression of CARD-3, 
CARD-4. CARD-5, or CARD-6 (e.g., an immunological disorder). For example, 
the kit can comprise a labeled compound or agent capable of detecting CARD-3, 
CARD-4, CARD-5, or CARD-6 protein or mRNA in a biological sample and 

1 5 means for determining the amount of CARD-3, CARD-4, CARD-5, or CARD-6 
in the sample (e.g., an anti-CARD-3, CARD-4, CARD-5, or CARD-6 antibody or 
an oligonucleotide probe which binds to DNA encoding CARD-3, CARD-4. 
CARD-5, or CARD-6. e.g.. SEQ ID NO: 1 , SEQ ID NO:3. SEQ ID NO:7. SEQ 
ID NO:9. SEQ ID NO:25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. SEQ 

2 0 ID NO:42, SEQ IS NO:48, SEQ ID NO:50. SEQ ID NO:5 1 , SEQ ID NO:53. 
SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:60, or SEQ ID NO:62). Kits may 
also include instruction for observing that the tested subject is suffering from or is 
at risk of developing a disorder associated with aberrant expression of CARD-3, 
CARD-4. CARD-5. or CARD-6 if the amount of CARD-3, CARD-4, CARD-5, 

2 5 or CARD-6 protein or mRNA is above or below a normal level. 

For antibody-based kits, the kit may comprise, for example: (1 ) a first 
antibody (e.g., attached to a solid support) which binds to CARD-3, CARD-4, 
CARD-5. or CARD-6 protein; and. optionally, (2) a second, different antibody 
which binds to CARD-3, CARD-4. CARD-5. or CARD-6 protein or the first 

3 0 antibody and is conjugated to a detectable agent. 

-108- 

BNSDOCID: <WO 0100826A2.I_> 



WO 01/00826 



PCT/US00/17691 



For oligonucleotide-based kits, the kit may comprise, for example: (1) 
a oligonucleotide, e.g.. a detectably labelled oligonucleotide, which hybridizes to 
a CARD-3, CARD-4, CARD-5. or CARD-6 nucleic acid sequence or (2) a pair of 
primers useful for amplifying a CARD-3, CARD-4, CARD-5, or CARD-6 
5 nucleic acid molecule. 

The kit may also comprise, e.g., a buffering agent, a preservative, or a 
protein stabilizing agent. The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a substrate). The kit may also 
contain a control sample or a series of control samples which can be assayed and 

10 compared to the test sample contained. Each component of the kit is usually 
enclosed within an individual container and all of the various containers are 
within a single package along with instructions for observing whether the tested 
subject is suffering from or is at risk of developing a disorder associated with 
aberrant expression of CARD-3, CARD-4, CARD-5. or CARD-6. 

15 2. Prognostic Assays 

The methods described herein can furthermore be utilized as 
diagnostic or prognostic assays to identify subjects having or at risk of 
developing a disease or disorder associated with aberrant CARD-3, CARD-4, 
CARD-5. or CARD-6 expression or activity. For example, the assays described 

2 0 herein, such as the preceding diagnostic assays or the following assays, can be 
utilized to identify a subject having or at risk of developing a disorder associated 
with CARD-3, CARD-4, CARD-5. or CARD-6 protein, nucleic acid expression 
or activity. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing such a disease or disorder. Thus, the 

2 5 present invention provides a method in which a test sample is obtained from a 

subject and CARD-3. CARD-4, CARD-5, or CARD-6 protein or nucleic acid 
(e.g., mRNA, genomic DNA) is detected, wherein the presence of CARD-3, 
CARD-4, CARD-5, or CARD-6 protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder associated with aberrant 

3 0 CARD-3, CARD-4. CARD-5, or CARD-6 expression or activity. As used 
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herein, a "test sample" refers to a biological sample obtained from a subject of 
interest. For example, a test sample can be a biological fluid (e.g.. serum), cell 
sample, or tissue. Furthermore, the prognostic assays described herein can be 
used to determine whether a subject can be administered an agent (e.g.. an 
5 agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small 

molecule, or other drug candidate) to treat a disease or disorder associated with 
aberrant CARD-3, CARD-4, CARD-5, or CARD-6 expression or activity. For 
example, such methods can be used to determine whether a subject can be 
effectively treated with a specific agent or class of agents (e.g., agents of a type 

1 0 which decrease CARD-3. CARD-4. CARD-5. or CARD-6 activity). Thus, the 
present invention provides methods for determining whether a subject can be 
effectively treated with an agent for a disorder associated with aberrant CARD-3. 
CARD-4, CARD-5, or CARD-6 expression or activity in which a test sample is 
obtained and CARD-3, CARD-4, CARD-5. or CARD-6 protein or nucleic acid is 

15 detected (e.g., wherein the presence of CARD-3, CARD-4. CARD-5, or CARD-6 
protein or nucleic acid is diagnostic for a subject that can be administered the 
agent to treat a disorder associated with aberrant CARD-3. CARD-4, CARD-5, or 
CARD-6 expression or activity). 

The methods of the invention can also be used to detect genetic 

2 0 lesions or mutations in a CARD-3, CARD-4, CARD-5, or CARD-6 gene, thereby 
determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In preferred 
embodiments, the methods include detecting, in a sample of cells from the 
subject, the presence or absence of a genetic lesion characterized by at least one 

2 5 of an alteration affecting the integrity of a gene encoding a CARD-3, CARD-4, 

CARD-5. or CARD-6-protein. or the mis-expression of the CARD-3. CARD-4. 
CARD-5, or CARD-6 gene. For example, such genetic lesions can be detected 
by ascertaining the existence of at least one of 1 ) a deletion of one or more 
nucleotides from a CARD-3. CARD-4. CARD-5. or CARD-6 gene; 2) an 

3 0 addition of one or more nucleotides to a CARD-3. CARD-4, CARD-5, or CARD- 
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6 gene; 3) a substitution of one or more nucleotides of a CARD-3, CARD-4, 
CARD-5, or CARD-6 gene; 4) a chromosomal rearrangement of a CARD-3, 
CARD-4, CARD-5, or CARD-6 gene; 5) an alteration in the level of a messenger 
RNA transcript of a CARD-3, CARD-4, CARD-5, or CARD-6 gene; 6) aberrant 
5 modification of a CARD-3, CARD-4, CARD-5, or CARD-6 gene, such as of the 
methylation pattern of the genomic DNA; 7) the presence of a non-wild type 
splicing pattern of a messenger RNA transcript of a CARD-3. CARD-4, CARD- 
5. or CARD-6 gene (e.g. caused by a mutation in a splice donor or splice acceptor 
site); 8) a non-wild type level of a CARD-3, CARD-4. CARD-5, or CARD-6- 

1 0 protein; 9) allelic loss of a CARD-3, CARD-4, CARD-5, or CARD-6 gene; and 
10) inappropriate post-translational modification of a CARD-3, CARD-4, 
CARD-5. or CARD-6-protein. As described herein, there are a large number of 
assay techniques known in the art which can be used for detecting lesions in a 
CARD-3. CARD-4, CARD-5, or CARD-6 gene. A biological sample is a 

15 peripheral blood leukocyte sample isolated by conventional means from a 
subject. 

In certain embodiments, detection of the lesion involves the use of a 
probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively. 
2 0 in a ligation chain reaction (LCR) (see. e.g.. Landegran et al. ( 1 988) Science 

241 ; 1077-1080; and Nakazawa et al. (1994) Proc. Natl. Acad. Sci. USA 91:360- 
364). the latter of which can be particularly useful for detecting point mutations 
in the CARD-3 or CARD-4-gene (see. e.g., Abravaya et al. (1995) Nucleic Acids 
Res. 23:675-682). This method can include the steps of collecting a sample of 

2 5 cells from a patient, isolating nucleic acid (e.g., genomic. mRNA or both) from 

the cells of the sample, contacting the nucleic acid sample with one or more 
primers which specifically hybridize to a CARD-3. CARD-4. CARD-5. or 
CARD-6 gene under conditions such that hybridization and amplification of the 
CARD-3, CARD-4. CARD-5, or CARD-6-gene (if present) occurs, and detecting 

3 0 the presence or absence of an amplification product, or detecting the size of the 

- Ill - 



BNSDOCIO: <WO 0100826A2_I_> 



WO 01/00826 



PCTAJSOO/17691 



amplification product and comparing the length to a control sample. It is 
anticipated that PCR and/or LCR may be desirable to use as a preliminary 
amplification step in conjunction with any of the techniques used for detecting 
mutations described herein. 
5 Alternative amplification methods include: self sustained sequence 

replication (Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), 
transcriptional amplification system (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:1 173-1 1 77), Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 
6:1 197), or any other nucleic acid amplification method, followed by the 

1 0 detection of the amplified molecules using techniques well known to those of 
skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in a CARD-3, CARD-4, 
CARD-5, or CARD-6 gene from a sample cell can be identified by alterations in 

1 5 restriction enzyme cleavage patterns. For example, sample and control DNA is 
isolated, amplified (optionally), digested with one or more restriction 
endonucleases. and fragment length sizes are determined by gel electrophoresis 
and compared. Differences in fragment length sizes between sample and control 
DNA indicates mutations in the sample DNA. Moreover, the use of sequence 

2 0 specific ribozymes (see, e.g.. U.S. Patent No. 5.498,53 1 ) can be used to score for 
the presence of specific mutations by development or loss of a ribozyme cleavage 
site. 

In other embodiments, genetic mutations in CARD-3, CARD-4, 
CARD-5. or CARD-6 can be identified by hybridizing a sample and control 

2 5 nucleic acids, e.g., DNA or RNA. to high density arrays containing hundreds or 

thousands of oligonucleotides probes (Cronin et al. (1996) Human Mutation 
7:244-255; Kozal et al. (1996) Nature Medicine 2:753-759). For example, 
genetic mutations in CARD-3 or CARD-4 can be identified in two-dimensional 
arrays containing light-generated DNA probes as described in Cronin et al. supra. 

3 0 Briefly, a first hybridization array of probes can be used to scan through long 
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stretches of DNA in a sample and control to identify base changes between the 
sequences by making linear arrays of sequential overlapping probes. This step 
allows the identification of point mutations. This step is followed by a second 
hybridization array that allows the characterization of specific mutations by using 
5 smaller, specialized probe arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant 
gene. 

In yet another embodiment, any of a variety of sequencing reactions 
10 known in the art can be used to directly sequence the CARD-3, CARD-4. CARD- 
5. or CARD-6 gene and detect mutations by comparing the sequence of the 
sample CARD-3, CARD-4, CARD-5, or CARD-6 with the corresponding wild- 
type (control) sequence. Examples of sequencing reactions include those based 
on techniques developed by Maxam and Gilbert ((1977) Proc. Natl. Acad. Sci. 
15 USA 74:560) or Sanger ((1977) Proc. Natl. Acad. Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ((1995) Bio/Techniques 19:448), 
including sequencing by mass spectrometry (see. e.g.. PCT Publication No. WO 
94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and Griffin et al. 
2 0 (1993) Appl. Biochem. Biotechnol. 38:147-159). 

Other methods for detecting mutations in the CARD-3 or CARD-4 
gene include methods in which protection from cleavage agents is used to detect 
mismatched bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et al. 
( 1 985) Science 230: 1242). In general, the art technique of "mismatch cleavage" 

2 5 starts by providing heteroduplexes of formed by hybridizing (labeled) RNA or 

DNA containing the wild-type CARD-3. CARD-4. CARD-5. or CARD-6 
sequence with potentially mutant RNA or DNA obtained from a tissue sample. 
The double-stranded duplexes are treated with an agent which cleaves single- 
stranded regions of the duplex such as which will exist due to basepair 

3 0 mismatches between the control and sample strands. For instance, RNA/DNA 
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duplexes can be treated with RNase and DNA/DNA hybrids treated with SI 
nuclease to enzymatically digesting the mismatched regions. In other 
embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest 
5 mismatched regions. After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing polyacrylamide gels to determine 
the site of mutation. See, e.g.. Cotton et al (1988) Proc. Natl Acad Sci USA 
85:4397; Saleeba et al (1992) Methods Enzymol. 217:286-295. In an 
embodiment, the control DNA or RNA can be labeled for detection. 

1 0 In still another embodiment, the mismatch cleavage reaction employs 

one or more proteins that recognize mismatched base pairs in double-stranded 
DNA (so called "DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in CARD-3. CARD-4. CARD-5, or 
CARD-6 cDNAs obtained from samples of cells. For example, the mutY enzyme 

15 of E. coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from 
HeLa cells cleaves T at G/T mismatches (Hsu et al. (1994) Carcinogenesis 
15:1657-1662). According to an exemplary embodiment, a probe based on a 
CARD-3, CARD-4, CARD-5, or CARD-6 sequence, e.g., a wild-type CARD-3. 
CARD-4. CARD-5. or"CARD-6 sequence, is hybridized to a cDNA or other 

2 0 DNA product from a test cell(s). The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can be detected from 
electrophoresis protocols or the like. See, e.g.. U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be 
used to identify mutations in CARD-3. CARD-4. CARD-5. or CARD-6 genes. 

2 5 For example, single strand conformation polymorphism (SSCP) may be used to 

detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al. (1989) Proc Natl. Acad. Sci USA: 86:2766. see also 
Cotton (1993) Mutat. Res. 285:125-144; and Hayashi (1992) Genet Anal Tech 
Appl 9:73-79). Single-stranded DNA fragments of sample and control CARD-3 

3 0 or CARD-4 nucleic acids will be denatured and allowed to renature. The 
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secondary structure of single-stranded nucleic acids varies according to sequence, 
the resulting alteration in electrophoretic mobility enables the detection of even a 
single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than 
5 DNA), in which the secondary structure is more sensitive to a change in 

sequence. In an embodiment, the subject method utilizes heteroduplex analysis 
to separate double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

In yet another embodiment, the movement of mutant or wild-type 

10 fragments in polyacrylamide gels containing a gradient of denaturant is assayed 
using denaturing gradient gel electrophoresis (DGGE) (Myers et al. (1985) 
Nature 3 13:495). When DGGE is used as the method of analysis, DNA will be 
modified to insure that it does not completely denature, for example by adding a 
GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. In a 

15 further embodiment, a temperature gradient is used in place of a denaturing 
gradient to identify differences in the mobility of control and sample DNA 
(Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, 
but are not limited to, selective oligonucleotide hybridization, selective 

2 0 amplification, or selective primer extension. For example, oligonucleotide 

primers may be prepared in which the known mutation is placed centrally and 
then hybridized to target DNA under conditions which permit hybridization only 
if a perfect match is found (Saiki et al. (1986) Nature 324:163); Saiki et al. (1989) 
Proc. Natl Acad. Sci USA 86:6230). Such allele specific oligonucleotides are 
25 hybridized to PCR amplified target DNA or a number of different mutations 
when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends 
on selective PCR amplification may be used in conjunction with the instant 

3 0 invention. Oligonucleotides used as primers for specific amplification may carry 
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the mutation of interest in the center of the molecule (so that amplification 
depends on differential hybridization) (Gibbs et al. (1989) Nucleic Acids Res. 
1 7:2437-2448) or at the extreme 3' end of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (Prossner 
5 ( 1 993) Tibtech 1 1 :238). In addition, it may be desirable to introduce a novel 
restriction site in the region of the mutation to create cleavage-based detection 
(Gasparini et al. (1992) Mol. Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for 
amplification (Barany ( 1 99 1 ) Proc. Natl. Acad. Sci USA 88:189). In such cases, 

10 ligation will occur only if there is a perfect match at the 3' end of the 5' sequence 
making it possible to detect the presence of a known mutation at a specific site by 
looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by 
utilizing pre-packaged diagnostic kits comprising at least one probe nucleic acid 

15 or antibody reagent described herein, which may be conveniently used, e.g., in 
clinical settings to diagnose patients exhibiting symptoms or family history of a 
disease or illness involving a CARD-3. CARD-4, CARD-5, or CARD-6 gene. 

Furthermore, any cell type or tissue, preferably peripheral blood 
leukocytes, in which CARD-3 or CARD-4 is expressed may be utilized in the 

2 0 prognostic assays described herein. 

3. Pharmacogenomics 

Agents, or modulators which have a stimulatory or inhibitory effect on 
CARD-3. CARD-4, CARD-5, or CARD-6 activity (e.g., CARD-3, CARD-4, 
CARD-5, or CARD-6 gene expression) as identified by a screening assay 

2 5 described herein can be administered to individuals to treat (prophylactically or 

therapeutically) disorders (e.g., an immunological disorder) associated with 
aberrant CARD-3. CARD-4. CARD-5, or CARD-6 activity. In conjunction with 
such treatment, the pharmacogenomics (i.e., the study of the relationship between 
an individual's genotype and that individual's response to a foreign compound or 

3 0 drug) of the individual may be considered. Differences in metabolism of 
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therapeutics can lead to severe toxicity or therapeutic failure by altering the 
relation between dose and blood concentration of the pharmacologically active 
drug. Thus, the pharmacogenomics of the individual permits the selection of 
effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
5 consideration of the individual's genotype. Such pharmacogenomics can further 
be used to determine appropriate dosages and therapeutic regimens. 
Accordingly, the activity of CARD-3. CARD-4, CARD-5. or CARD-6 protein, 
expression of CARD-3, CARD-4. CARD-5. or CARD-6 nucleic acid, or 
mutation content of CARD-3, CARD-4, CARD-5, or CARD-6 genes in an 

1 C individual can be determined to thereby select appropriate agent(s) for therapeutic 
or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary 
variations in the response to drugs due to altered drug disposition and abnormal 
action in affected persons. See, e.g., Linder (1997) Clin. Chem. 43(2):254-266. 

1 5 In general, two types of pharmacogenetic conditions can be differentiated. 

Genetic conditions transmitted as a single factor altering the way drugs act on the 
body (altered drug action) or genetic conditions transmitted as single factors 
altering the way the body acts on drugs (altered drug metabolism). These 
pharmacogenetic conditions can occur either as rare defects or as polymorphisms. 

2 0 For example, glucose-6-phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, 
analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing 

2 5 enzymes is a major determinant of both the intensity and duration of drug action. 

The discovery of genetic polymorphisms of drug metabolizing enzymes (e.g., N- 
acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 and 
CYP2C1 9) has provided an explanation as to why some patients do not obtain the 
expected drug effects or show exaggerated drug response and serious toxicity 

3 0 after taking the standard and safe dose of a drug. These polymorphisms are 
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expressed in two phenotypes in the population, the extensive metabolizer (EM) 
and poor metabolizer (PM). The prevalence of PM is different among different 
populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence 
5 of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite 
frequently experience exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active therapeutic moiety, PM show 
no therapeutic response, as demonstrated for the analgesic effect of codeine 
mediated by its CYP2D6-formed metabolite morphine. The other extreme are 

1 0 the so called ultra-rapid metabolizers who do not respond to standard doses. 

Recently, the molecular basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of CARD-3. CARD-4, CARD-5. or CARD-6 
protein, expression of CARD-3 or CARD-4 nucleic acid, or mutation content of 

1 5 CARD-3, CARD-4, CARD-5, or CARD-6 genes in an individual can be 

determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, pharmacogenetic studies can be used to 
apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotype. This 

2 0 knowledge, when applied to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or prophylactic efficiency 
when treating a subject with a CARD-3, CARD-4. CARD-5, or CARD-6 
modulator, such as a modulator identified by one of the exemplary screening 
assays described herein. 

2 5 4. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g.. drugs, compounds) on the 
expression or activity of CARD-3. CARD-4. CARD-5. or CARD-6 (e.g.. the 
ability to modulate aberrant cell proliferation and/or differentiation) can be 
applied not only in basic drug screening, but also in clinical trials. For example. 

3 0 the effectiveness of an agent determined by a screening assay as described herein 
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to increase CARD-3, CARD-4, CARD-5, or CARD-6 gene expression, protein 
levels, or upregulate CARD-3. CARD-4, CARD-5, or CARD-6 activity, can be 
monitored in clinical trails of subjects exhibiting decreased CARD-3, CARD-4, 
CARD-5, or CARD-6 gene expression, protein levels, or downregulated CARD- 
5 3, CARD-4. CARD-5, or CARD-6 activity. Alternatively, the effectiveness of an 
agent determined by a screening assay to decrease CARD-3, CARD-4. CARD-5, 
or CARD-6 gene expression, protein levels, or downregulated CARD-3. CARD- 
4. CARD-5. or CARD-6 activity, can be monitored in clinical trials of subjects 
exhibiting increased CARD-3. CARD-4. CARD-5, or CARD-6 gene expression. 

10 protein levels, or upregulated CARD-3, CARD-4. CARD-5. or CARD-6 activity. 
In such clinical trials, the expression or activity of CARD-3. CARD-4. CARD-5. 
or CARD-6 and, preferably, other genes that have been implicated in. for 
example, a cellular proliferation disorder can be used as a "read out" or markers 
of the immune responsiveness of a particular cell. 

15 For example, and not by way of limitation, genes, including CARD-3, 

CARD-4, CARD-5, or CARD-6, that are modulated in cells by treatment with an 
agent (e.g., compound, drug or small molecule) which modulates CARD-3, 
CARD-4. CARD-5. or CARD-6 activity (e.g.. identified in a screening assay as 
described herein) can be identified. Thus, to study the effect of agents on cellular 

20 proliferation disorders, for example, in a clinical trial, cells can be isolated and 
RNA prepared and analyzed for the levels of expression of CARD-3. CARD-4. 
CARD-5. or CARD-6 and other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be quantified by Northern 
blot analysis or RT-PCR. as described herein, or alternatively by measuring the 

2 5 amount of protein produced, by one of the methods as described herein, or by 

measuring the levels of activity of CARD-3. CARD-4, CARD-5, or CARD-6 or 
other genes. In this way. the gene expression pattern can serve as a marker, 
indicative of the physiological response of the cells to the agent. Accordingly, 
this response state may be determined before, and at various points during, 

3 0 treatment of the individual with the agent. 
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In an embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an 
agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate identified by the screening assays described 
5 herein) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression 
of a CARD-3, CARD-4, CARD-5, or CARD-6 protein, mRNA, or genomic DNA 
in the preadministration sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of expression or activity of the 

10 CARD-3. CARD-4. CARD-5, or CARD-6 protein, mRNA, or genomic DNA in 
the post-administration samples; (v) comparing the level of expression or activity 
of the CARD-3, CARD-4, CARD-5. or CARD-6 protein. mRNA. or genomic 
DNA in the pre-administraiion sample with the CARD-3, CARD-4. CARD-5. or 
CARD-6 protein, mRNA, or genomic DNA in the post administration sample or 

15 samples; and (vi) altering the administration of the agent to the subject 

accordingly. For example, increased administration of the agent may be desirable 
to increase the expression or activity of CARD-3, CARD-4, CARD-5. or CARD- 
6 to higher levels than detected, i.e.. to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may be desirable to decrease 

2 0 expression or activity of CARD-3, CARD-4, CARD-5, or CARD-6 to lower 
levels than detected, i.e.. to decrease the effectiveness of the agent. 
5. Transcriptional Profiling 

The CARD-3, CARD-4. CARD-5. and CARD-6 nucleic acid 
molecules described herein, including small oligonucleotides, can be used in 

2 5 transcriptionally profiling. For example, these nucleic acids can be used to 

examine the expression of CARD-3. CARD-4, CARD-5. and CARD-6 in normal 
tissue or cells and in tissue or cells subject to a disease state, e.g., tissue or cells 
derived from a patient having a disease of interest or cultured cells which model 
or reflect a disease state of interest, e.g., cells of a cultured tumor cell line. By 

3 0 measuring expression of CARD-3, CARD-4. CARD-5, and CARD-6, together or 
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individually, a profile of expression in normal and disease states can be 
developed. This profile can be used diagnostically and to examine the 
effectiveness of a therapeutic regime. 

5 C. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic 
methods of treating a subject at risk of (or susceptible to) a disorder or having a 
disorder associated with aberrant CARD-3. CARD-4. CARD-5. or CARD-6 
expression or activity, examples of which are provided herein. 

10 1 . Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a 
subject, a disease or condition associated with an aberrant CARD-3, CARD-4, 
CARD-5, or CARD-6 expression or activity, by administering to the subject an 
agent which modulates CARD-3, CARD-4. CARD-5, or CARD-6 expression or 

15 at least one CARD-3, CARD-4, CARD-5, or CARD-6 activity. Subjects at risk 
for a disease which is caused or contributed to by aberrant CARD-3, CARD-4, 
CARD-5. or CARD-6 expression or activity can be identified by. for example, 
any or a combination of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of 

2 0 symptoms characteristic of the CARD-3, CARD-4. CARD-5, or CARD-6 

aberrancy, such that a disease or disorder is prevented or. alternatively, delayed in 
its progression. Depending on the type of CARD-3. CARD-4, CARD-5, or 
CARD-6 aberrancy, for example, a CARD-3. CARD-4, CARD-5, or CARD-6 
agonist or CARD-3. CARD-4, CARD-5, or CARD-6 antagonist agent can be 

2 5 used for treating the subject. The appropriate agent can be determined based on 

screening assays described herein. Activities of CARD-3, CARD-4, CARD-5, or 
CARD-6 that could be modulated for prophylactic purposes include, but are not 
limited to: 1 ) CARD-3, CARD-4, CARD-5, or CARD-6 gene or protein 
expression, for example, see Example 11 for a description of the mRNA 

3 0 expression pattern of human CARD-4: 2)CARD-3. CARD-4, CARD-5. or 

- 121 - 

BNSDCCID: <WO 0100826A2_I_> 



WO 01/00826 



PCT/US0O/17691 



CARD-6 binding to a target protein, for example, see Examples 7. 8. and 1 2 for a 
description of proteins known to bind to CARD-3 or CARD-4; 3) CARD-4 
regulation of NF-kB as described in Example 9; and 4) CARD-3 and CARD-4 
enhancement of caspase 9 activity as described in Example 1 0. 
5 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating 
CARD-3. CARD-4, CARD-5, or CARD-6 expression or activity for therapeutic 
purposes. The modulatory method of the invention involves contacting a cell 
with an agent that modulates one or more of the activities of CARD-3, CARD-4. 

1 0 CARD-5, or CARD-6 protein activity associated with the cell. An agent that 
modulates CARD-3, CARD-4. CARD-5. or CARD-6 protein activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally- 
occurring cognate ligand of a CARD-3. CARD-4. CARD-5. or CARD-6 protein, 
a peptide, a CARD-3, CARD-4, CARD-5, or CARD-6 peptidomimetic. or other 

15 small molecule. In one embodiment, the agent stimulates one or more of the 
biological activities of CARD-3, CARD-4, CARD-5, or CARD-6 protein. 
Examples of such stimulatory agents include active CARD-3, CARD-4, CARD- 
5, or CARD-6 protein and a nucleic acid molecule encoding CARD-3, CARD-4. 
CARD-5, or CARD-6 that has been introduced into the cell. In another 

2 0 embodiment, the agent inhibits one or more of the biological activities of C ARD- 
3. CARD-4. CARD-5. or CARD-6 protein. Examples of such inhibitory agents 
include antisense CARD-3, CARD-4, CARD-5, or CARD-6 nucleic acid 
molecules and anti-CARD-3, CARD-4, CARD-5, or CARD-6 antibodies. These 
modulatory methods can be performed in vitro (e.g., by culturing the cell with the 

2 5 agent) or, alternatively, in vivo (e.g, by administering the agent to a subject). As 

such, the present invention provides methods of treating an individual afflicted 
with a disease or disorder characterized by aberrant expression or activity of a 
CARD-3. CARD-4, CARD-5. or CARD-6 protein or nucleic acid molecule or a 
disorder related to CARD-3. CARD-4. CARD-5 or CARD-6 expression or 

3 0 activity. In one embodiment, the method involves administering an agent (e.g.. 
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an agent identified by a screening assay described herein), or combination of 
agents that modulates (e.g., upregulates or downregulates) CARD-3, CARD-4. 
CARD-5, or CARD-6 expression or activity. In another embodiment, the method 
involves administering a CARD-3, CARD-4, CARD-5, or CARD-6 protein or 
5 nucleic acid molecule as therapy to compensate for reduced or aberrant CARD-3, 
CARD-4, CARD-5, or CARD-6 expression or activity. Activities of CARD-3, 
CARD-4, CARD-5, or CARD-6 that could be modulated for therapeutic purposes 
include, but are not limited to. 1) CARD-3. CARD-4, CARD-5, or CARD-6 gene 
or protein expression, for example, see Example 11 for a description of the 

1 C mRNA expression pattern of human CARD-4; 2) CARD-3. CARD-4. CARD-5. 
or CARD-6 binding to a target protein, for example, see Examples 7, 8. and 12 
tor a description of proteins known to bind to CARD-3 or CARD-4; 3) CARD-4 
regulation of NF-kB as described in Example 9; and 4) CARD-4 enhancement of 
caspase 9 activity as described in Example 10. 

1 5 Stimulation of CARD-3, CARD-4, CARD-5, or CARD-6 activity is 

desirable in situations in which CARD-3, CARD-4, CARD-5. or CARD-6 is 
abnormally downregulated and/or in which increased CARD-3, CARD-4, 
CARD-5, or CARD-6 activity is likely to have a beneficial effect. Conversely, 
inhibition of CARD-3. CARD-4, CARD-5. or CARD-6 activity is desirable in 

2 0 situations in which CARD-3, CARD-4. CARD-5. or CARD-6 is abnormally 
upregulated, e.g., in myocardial infarction, and/or in which decreased CARD-3, 
CARD-4. CARD-5, or CARD-6 activity is likely to have a beneficial effect. 
Since CARD-4 may play be involved in the processing of cytokines, inhibiting 
the activity or expression CARD-4 may be beneficial in patients that have 

2 5 aberrant inflammation. 

This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
published patent applications cited throughout this application are hereby 
incorporated by reference. 
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EXAMPLES 

Example 1 : Isolation and Characterization of full- length Human 
CARD-3 and CARD-4L/S cDNAs. 

A profile of known CARD domains was used to search databases of 
5 cDN A sequences and partial cDNA sequences using TBLASTN (Washington 
University; version 2.0, BLOSUM62 search matix). This search led to the 
identification of CARD-3. Using CARD-3 to search databases of cDNA 
sequences and partial cDNA sequences, another potential CARD cDNA was 
found. This cDNA sequence was used screen a human umbilical vein endothelial 
1 0 library (HUVE) and a clone containing the partial CARD-4S was identified. The 
human umbilical vein endothelial library was then rescreened using a probe 
designed against the partial CARD-4S sequence and a clone containing the 
CARD-4L sequence was identified. 

1 5 Example 2: Characterization of CARD-3 AND CARD-4L/S Proteins. 

In this example, the predicted amino acid sequences of human CARD- 
3 and C ARD-4L/S proteins were compared to amino acid sequences of known 
proteins and various motifs were identified. For example, the CARD domains of 
CARD-3 and CARD-4 were aligned (Figure 7) with the CARD domains of ARC - 

2 0 CARD (SEQ ID NO:31), cIAPl-CARD (SEQ ID NO:32) and cIAP2-CARD 
(SEQ ID NO:33). In addition, the molecular weight of the human CARD-3 and 
CARD-4L/S proteins were predicted. 

The human CARD-3 cDNA was isolated as described above (Figure 
1 : SEQ ID NO: 1 ) and encodes a 540 amino acid protein (Figure 2: SEQ ID 

2 5 NO:2). CARD-3 also includes one predicted kinase domain (amino acid 1 to 

amino acid 300 of SEQ ID NO:2; SEQ ID NO:4), which is followed by a 
predicted linker domain (amino acid 301 to amino acid 43 1 of SEQ ID NO:2; 
SEQ ID NO:5) and a predicted CARD domain (amino acid 432 to amino acid 
540 of SEQ ID NO:2: SEQ ID NO:6). 

3 0 The human CARD-4L cDNA was isolated as described above (Figure 
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3; SEQ ID NO:7) and has a 2859 nucleotide open reading frame (nucleotides 
245-3103 of SEQ ID NO:7; SEQ ID NO: 9) which encodes a 953 amino acid 
protein (Figure 4; SEQ ID NO:8). CARD-4L protein has a predicted CARD 
domain (amino acids 15-1 14; SEQ ID NO: 10). CARD-4L is also predicted to 
5 have a nucleotide binding domain which extends from about amino acid 198 to 
about amino acid 397 of SEQ ID NO:8; SEQ ID NO:l 1, a predicted Walker Box 
"A", which extends from about amino acid 202 to about amino acid 209 of SEQ 
ID NO:8; SEQ ID NO: 12, a predicted Walker Box "B", which extends from 
about amino acid 280 to about amino acid 284, of SEQ ID NO:8: SEQ ID NO: 13, 

10 a predicted kinase la (P-loop) domain, which extends from about amino acid 197 
to about amino acid 212 of SEQ ID NO: 8; SEQ ID NO:46. a predicted kinase 2 
domain, which extends from about amino acid 273 to about amino acid 288 of 
SEQ ID NO:8; SEQ ID NO:47, a predicted kinase 3a subdomain. which extends 
from about amino acid 327 to about amino acid 338 of SEQ ID NO:8; SEQ ID 

1 5 NO: 14, ten predicted Leucine-rich repeats which extend from about amino acid 
674 to about amino acid 950 of SEQ ID NO: 8. The first Leucine-rich repeat is 
predicted to extend from about amino acid 674 to about amino acid 701 of SEQ 
ID NO:8; SEQ ID NO: 15. The second Leucine-rich repeat is predicted to extend 
from about amino acid 702 to about amino acid 727 of SEQ ID NO:8; SEQ ID 

2 0 NO: 1 6. The third Leucine-rich repeat is predicted to extend from about amino 
acid 728 to about amino acid 754 of SEQ ID NO:8; SEQ ID NO: 17. The fourth 
Leucine-rich repeat is predicted to extend from about amino acid 755 to about 
amino acid 782 of SEQ ID NO:8: SEQ ID NO: 18. The fifth Leucine-rich repeat 
is predicted to extend from about amino acid 783 to about amino acid 810 of SEQ 

2 5 ID NO:8: SEQ ID NO: 1 9. The sixth Leucine-rich repeat is predicted to extend 

from about amino acid 81 1 to about amino acid 838 of SEQ ID NO:8; SEQ ID 
NO:20. The seventh Leucine-rich repeat is predicted to extend from about amino 
acid 839 to about amino acid 866 of SEQ ID NO:8: SEQ ID NO:21. The eighth 
Leucine-rich repeat is predicted to extend from about amino acid 867 to about 

3 0 amino acid 894 of SEQ ID NO:8; SEQ ID NO:22. The ninth Leucine-rich repeat 
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is predicted to extend from about amino acid 895 to about amino acid 922 of 
SEQ ID NO: 8; SEQ ID NO:23 and the tenth Leucine-rich repeat is predicted to 
extend from about amino acid 923 to about amino acid 950 of SEQ ID NO:8; 
SEQ ID NO:24. 

5 The human partial CARD-4S cDNA isolated as described above 

(Figure 5; SEQ ID NO:25) encodes a 490 amino acid protein (Figure 6; SEQ ID 
NO:26). CARD-4S includes one predicted partial CARD domain (amino acids 1- 
74 of SEQ ID NO:26). CARD-4S is also predicted to have a P-Loop which 
extends from about amino acid 1 63 to about amino acid 1 70 of SEQ ID NO:26; 

1 0 SEQ ID NO:29, and a predicted Walker Box "B" which extends form about 

amino acid 241 to about amino acid 245 of SEQ ID NO:26; SEQ ID NO:30. 

A plot showing the predicted structural features of CARD-4L is 
presented in Figure 8. This figure shows the predicted alpha regions (Garnier- 
Robinson and Chou-Fasman), the predicted beta regions (Garnier-Robinson and 
1 5 Chou-Fasman), the predicted turn regions (Garnier-Robinson and Chou-Fasman) 
and the predicted coil regions (Garnier-Robinson and Chou-Fasman). Also 
included in the figure is a hydrophilicity plot (Kyte-Doolittle), the predicted alpha 
and beta-amphatic regions (Eisenberg), the predicted flexible regions (Karplus- 
Schulz). the predicted antigenic index (Jameson- Wolf) and the predicted surface 

2 0 probability plot (Emini). 

A plot showing the predicted sturctural features of CARD-4S is also 
presented in Figure 9. This figure shows the predicted alpha regions (Garnier- 
Robinson and Chou-Fasman), the predicted beta regions (Garnier-Robinson and 
Chou-Fasman), the predicted turn regions (Garnier-Robinson and Chou-Fasman) 
2 5 and the predicted coil regions (Garnier-Robinson and Chou-Fasman). Also 

included in the figure is a hydrophilicity plot (Kyte-Doolittle). the predicted alpha 
and beta-amphatic regions (Eisenberg), the predicted flexible regions (Karplus- 
Schulz), the predicted antigenic index (Jameson-Wolf) and the predicted surface 
probability plot (Emini). 

30 
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The predicted MW of CARD-3 is approximately 61 kDa. The 
predicted MW of CARD-4L is approximately 1 08 kDa. 

Example 3: Preparation of CARD-3 and CARD-4 Proteins 
5 Recombinant CARD-3 and CARD-4 can be produced in a variety of 

expression systems. For example, the CARD-3 and CARD-4 peptides can be 
expressed as a recombinant glutathione-S-transferase (GST) fusion protein in E. 
coli and the fusion protein can be isolated and characterized. Specifically, as 
described above, CARD-3 or CARD-4 can be fused to GST and the fusion 
1 0 protein can be expressed in E. coli strain PEB199. Expression of the GST- 

CARD-3 or GST-CARD-4 fusion protein in PEB199 can be induced with IPTG. 
The recombinant fusion protein can be purified from crude bacterial lysates of the 
induced PEB199 strain by affinity chromatography on glutathione beads. 

15 Example 4: Identification of splice variants of CARD-4. 

The 5' untranslated sequence from CARD-4L was used to search 
databases of cDNA sequences and partial cDNA sequences using BLASTN 
(Washington University; version 2.0, BLOSUM62 search matrix) for additional 
CARD-4 cDNA clones. This search led to the identification of two cDNA 

2 0 clones, clone Z from a human lymph node library and the Y clone from a human 
brain cDNA library. Both clones were sequenced and found to represent 
probable splice variants of CARD-4 that encode truncated CARD-4 proteins, Y 
encoding a 249 amino acid protein and Z encoding a 164 amino acid protein. 
Fig. 10 shows the nucleotide (SEQ ID NO:38) and Fig. 1 1 the predicted amino 

2 5 acid (SEQ ID NO:39) sequences of human CARD-4 Y; Fig. 12 shows the 
nucleotide (SEQ ID NO:40) and Fig. 13 the amino acid (SEQ ID NO:41) 
sequences of human CARD-4Z; and Fig. 14 shows an alignment of the CARD- 
4L. CARD-4Y, and CARD-4Z amino acid sequences generated by the Clustal 
program using a PAM250 residue weight table. 
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Example 5: Identification of murine CARD-4. 

The CARD-4 polypeptide sequence was used to search databases of 
cDNA sequences and partial cDNA sequences using the TBLASTN program 
(version 1.4, BLOSUM62 search matrix, and a word length of 3) for murine 
5 CARD-4 cDNA clones. This search led to the identification of a partial murine 
CARD-4 clone designated murine CARD-4L. The rapid identification of cDNA 
ends procedure (RACE) was applied to the 5' end of the murine CARD-4L clone 
to elucidate the 5' end of the murine CARD-4L cDNA. Fig. 15 shows the murine 
CARD-4L nucleotide sequence (SEQ ID NO:42), Figure 16 shows the murine 
1 0 CARD-4L amino acid sequence (SEQ ID NO:43), and Fig. 1 7 shows an 

alignment of the murine CARD-4L and human CARD-4L amino acid sequences 
generated by the Clustal program using a PAM250 residue weight table. 

Example 6: Identification of the chromosomal location of human CARD-4. 

15 To determine the chromosomal location of the human CARD-4 gene, 

the polymerase chain reaction carried out with human CARD-4-specific primers 
card4t, with the 5' to 3' sequence agaaggtctggtcggcaaa (SEQ ID NO:44), and 
card4k, with the 5' to 3' sequence aagccctgagtggaagca (SEQ ID NO:45). was used 
to screen DNAs from a commercially available somatic cell hybrid panel. This 

2 0 analysis showed that human CARD-4 maps to chromosome 7 close to the SHGC- 
3 1928 genetic marker. 

Example 7: Identification of CARD-3 in a yeast two-hybrid screen for proteins 
that physically interact with the CARD domain of human CARD-4. 

2 5 DNA encoding amino acids 1 - 1 45 of human CARD-4 comprising the 

CARD domain was cloned into a yeast two-hybrid screening vector to create a 
CARD-4, 1-145-GAL4 DNA-binding domain fusion for two-hybrid screening. 
The CARD-4,1-145-GAL4 DNA-binding domain fusion was used to screen 
human mammary gland and human prostate two-hybrid libraries for gene 

3 0 products that could physically associate with CARD-4. 1 -145. Twelve library 
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plasmids expressing CARD4, 1-145 interacting proteins were found to contain the 
CARD-domain containing protein CARD-3 thus establishing a direct or indirect 
physical interaction between CARD-4 and CARD-3. 

In addition, DNA encoding amino acids 435-540 of CARD-3 
5 comprising the CARD domain of CARD-3 (SEQ ID NO:6) was cloned into a 
yeast two-hybrid GAL4 transcriptional activation domain fusion vector to create 
a CARD-3,435-540-GAL4 transcriptional activation domain fusion. To test 
whether the CARD domain of CARD-3 binds CARD-4, 1-145, the CARD-3,435- 
540-GAL4 transcriptional activation domain fusion expression vector and the 

10 CARD-4J-145-GAL4 DNA-binding domain fusion vector were cotransformed 
into a two-hybrid screening Saccharomyces cerevisiae (yeast) strain. The 
resulting cotransformed yeast strain expressed the two reporter genes that 
indicate a physical interaction between the two hybrid proteins in the experiment, 
in this case, the CARD-3,435-540-GAL4 transcriptional activation domain fusion 

1 5 protein and the CARD-4, 1 - 1 45-GAL4 DNA-binding domain fusion protein. This 
experiment established a physical interaction between the CARD domain of 
CARD-3 and the CARD domain of CARD-4. 

Example 8: Identification of hNUDC in a yeast two-hybrid screen for proteins 
2 0 that physically interact with the LRR domain of human CARD-4. 

DNA encoding amino acids 406-953 of human CARD-4L comprising 
the LRR domain was cloned into a yeast two-hybrid screening vector to create a 
CARD-4,406-953-GAL4 DNA-binding domain fusion for two-hybrid screening. 
The CARD-4.406-953-GAL4 DNA-binding domain fusion was used to screen a 
2 5 human mammary gland two-hybrid library for gene products that could 

physically associate with CARD-4,406-953. One library plasmid expressing a 
CARD-4,406-953 interacting protein was found to contain the hNUDC protein, 
the human ortholog of the rat NUDC protein that has been implicated in nuclear 
movement (Morris et al., Curr. Biol. 8:603 [1998], Morris et al.. Exp. Cell Res. 

30 
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238:23 [1998]), thus establishing a physical interaction between CARD-4 and 
hNUDC. 

Example 9: Discovery of regulation by CARD-4 of NF-kB. 
5 The first group of experiments described in this Example were carried 

out to determine if CARD-4 can activate the NF-kB pathway. CARD-4 
regulation of the NF-kB pathway is of interest because the NF-kB pathway is 
involved in many diseases described in (New England Journal of Medicine 
336: 1 066 [ 1 997]) and (American Journal of Cardiology 76: 1 8C [ 1 995]) and other 

1 0 references known to those skilled in the art. Participation of CARD-4 in the NF- 
kB pathway would make CARD-4 an attractive target for drugs that modulate the 
NF-kB pathway for treatment of NF-kB pathway-dependent diseases, conditions, 
and biological processes. 

The first group of experiments showed specific CARD-4-mediated 

1 5 NF-kB pathway induction. 

The second group of experiments described in this Example were 
carried out to determine if CARD-3, the NIK serine/threonine protein kinase (Su 
et al.. EMBO J. 16:1279 [1997]), or the signal transduction protein TRAF6 (Cao 
et al., Nature 383:443 [1996]), proteins known to participate in the induction of 

2 0 NF-kB (McCarthy et al.. J. Biol. Chem. 273:16968 [1998]), are involved in 
transducing the CARD-4-dependent NF-kB pathway induction signal. It was 
found that CARD-3, NIK. and TRAF6 are all involved in transducing the CARD- 
4-mediated NF-kB pathway induction signal. 

In nine transfection experiments. 293T cells coexpressing an NF-kB 

2 5 reporter plasmid and either pCI, pCI-CARD-4L (expressing CARD-4L), pCI- 

CARD-4S (expressing CARD-4S), pCI-APAFL (expressing Apaf-1). pCI- 
AP AFS (expressing an Apaf- 1 variant lacking WD repeats), pCI-CARD- 
4LnoCARD (expressing CARD-4L without a CARD domain), pCI- 
CARD4LnoLRR (expressing CARD-4L without a LRR), pCI- 

3 0 CARD4LCARDonly (expressing CARD-4L CARD domain only), or pCI- 
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CARD4NBSonly (expressing CARD-4L nucleotide binding sequence only) were 
created. 293T cells cells were plated in 6-weII plates (35 mm wells) and 
transfected 2 days later (90% confluency) with 1 p.g of NF-kB luciferase reporter 
plasmid (pNF-icB-Luc, Stratagene), 200 ng of pCMV ii-gal, 600 ng of pCI vector 
5 and 200 ng of indicated expression plasmids using SuperFect transfection reagent 
(Qiagen). For dominant-negative experiments, 2 ng of CARD4 expressing 
plasmid and 800 ng of dominant-negative plasmid were used. Cells were 
harvested 48 h after transfection and luciferase activity in 1000-fold diluted cell 
extracts was determined using the Luciferase Assay System (Promega). In 

1 0 addition, B-galactosidase activities were determined and used to normalize 
transfection efficiency. 

Relative luciferase activity was determined at the end of the 
experiment to assess NF-kB pathway activation by the gene expressed by the 
pCI-based plasmid in each transfected cell line. The cell lines containing pCI. 

1 5 pCI- APAFS , pCI-APAFL, pCI-C ARD-4LnoC ARD, and pCI-C ARD4NBSonly 
had similar baseline levels of luciferase expression but the cell lines containing 
pCI-CARD-4L, pCI-CARD4LnoLRR, and pCI-CARD4LCARDonly had 
luciferase expression about nine fold elevated relative to baseline and the cell line 
containing pCI-CAR_D4S had luciferase expression sixteen fold elevated relative 

2 0 to baseline. This result demonstrates induction by CARD-4S and CARD-4L of 
the NF-kB pathway. This CARD-4 mediated NF-kB pathway induction is 
dependent on the CARD-4 CARD domain because the pCI-CARD-4noCARD 
construct expressing CARD-4 lacking its CARD domain did not induce the 
luciferase reporter gene and pCI-CARD4LCARDonly expressing the CARD-4 

2 5 CARD domain did induce the luciferase reporter gene. Also, the CARD-4 LRR 

domains are not required for NF-kB pathway activation because pCI- 
CARD4LnoLRR expressing a CARD-4 mutant protein lacking LRR domains is 
able to induce the luciferase reporter gene. In addition, the CARD-4 NBS 
domain is not sufficient for NF-kB pathway activation because pCI- 

3 0 CARD4NBSonly expressing CARD-4 NBS domain is not able to induce the 
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luciferase reporter gene. In addition, the induction of the NF-kB pathway by 
CARD-4 is specific, as neither Apaf-expressing construct in this experiment 
induced luciferase activation. 

In five transfection experiments, 293T cells coexpressing an NF-kB 
5 reporter plasmid (NF-icB-luciferase, Stratagene) and pCI-CARD-4L and either, 
no vector, pCI-TRAF6-DN (expressing a dominant negative version of TRAF-6), 
pCI-NIK-DN (expressing a dominant negative version of NIK kinase), pCI- 
CARD3CARDonly (expressing the CARD domain of CARD-3, which acts as a 
dominant negative version of CARD-3). or pCI-Bcl-XL (expressing the anti- 

1 0 apoptotic protein Bcl-XL) were created. TRAF6-DN. NIK-DN, and CARD3- 
CARDonly are dominant negative alleles of the TRAF6. NIK, and CARD3 
genes, respectively. After 48 hours, cells were lysed and the relative luciferase 
activity was determined (Promega Kit) to assess NF-kB pathway activation by 
the genes expressed by the one or two pCI-based plasmids in each transfected cell 

1 5 line. The cell lines containing pCI-CARD-4L only or pCI-CARD-4L and pCI- 
Bcl-XL had relative luciferase reporter gene expression of about 1 8 units. The 
cell lines containing pCI-CARD-4L and pCI-TRAF6-DN, pCI-CARD-4L and 
pCI-NIK-DN, or pCI-CARD-4L and pCI-CARD3CARDonly had relative 
luciferase reporter gene expression of about 4 units. Inhibition of CARD-4L- 

2 0 mediated NF-kB pathway induction by TRAF6-DN, NIK-DN, and CARD- 
3CARDonly is specific as Bcl-XL did not inhibit CARD-4L-mediated NF-kB 
pathway induction. 

These results demonstrate that dominant negative alleles of TRAF6. 
NIK and CARD-3 expressed, respectively, from pCI-TRAF6-DN, pCI-NIK-DN, 

2 5 and pCI-CARD3CARDonly block induction of the NF-kB reporter gene by 

CARD-4L expression (pCI-CARD-4L) and suggest that TRAF6, NIK. and 
CARD-3 act downstream of CARD-4L to transduce the CARD-4L NF-kB 
pathway induction stimulus. 

In an additional experiment, coexpression of CARD-4 and the CARD 

3 0 domain of CARD-3 revealed that the CARD domain of CARD-3 functions as a 
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dominant negative mutant suggesting that CARD-3 is a downstream mediator of 
CARD-4 function. 

Example 10: Discovery of CARD-4 enhancement of caspase-9 activity. 
5 In ten transfection experiments, 293T cells coexpressing a beta 

galactosidase-expressing plasmid (pCMV (3-gal from Stratagene) as a marker for 
viable cells and either pCl, pCI-CARD-3, pCI-APAF, pCl-CARD-4L, pCI- 
CARD-4S, pCI-CARD4LnoLRR, pCI-CARD4NBSonly, pCI- 
CARD4LCARDonly, pCI-CARD-4LnoCARD or pCI-casp9 (expressing caspase- 

10 9) were created. Transfections included 400 ng of pCMV P-gal. 800 ng of 
expression plasmid, and Superfect transfection reagent from Qiagen and were 
carried out according to the manufacturer's directions. After 40-48 hours, cells 
were fixed and stained for beta-galactosidase expression and cell viability was 
determined by counting the number of beta galactosidase positive cells. 

1 5 Expression of pCI, pCI-C ARD-3 , pCI-APAF, pCI-C ARD-4L, pCI-C ARD-4S, 
pCI-CARD4LnoLRR, pCI-CARD4NBSonly, pCI-CARD4LCARDonly, and pCI- 
CARD-4LnoCARD did not result in loss of cell viability. As expected, 
expression of pCl-casp9 in 293T cells resulted in a loss of viability of about 75% 
of the cells in the experiment. 

2 0 It was next tested whether pCI, pCI-C ARD-3, pCI-APAF, pCI- 

CARD-4L, pCI-CARD-4S, pCI-CARD4LnoLRR, pCI-CARD4NBSonly, pCI- 
CARD4LCARDonly, or pCI-CARD-4LnoCARD would regulate caspase 9- 
mediated apoptosis. In nine transfection experiments. 293 T cells coexpressing a 
beta galactosidase-expressing plasmid as a marker for viable cells. pCI-casp9, 

2 5 and either pCI. pCI-CARD-3. pCI-APAF. pCl-CARD-4L, pCI-CARD-4S. pCI- 

CARD4LnoLRR, pCI-CARD4NBSonly, pCI-CARD4LCARDonly. and pCI- 
CARD-4LnoCARD were created. After 40-48 hours, cells were fixed and 
stained for beta-galactosidase expression and cell viability was determined by 
counting the number of beta galactosidase positive cells. Expression of pCL pCI- 

3 0 CARD-4LnoCARD, and pCI-CARD4NBSonly in the caspase 9-expressing 293T 
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cells had no effect on the caspase 9-induced apoptosis. However, pCl-CARD-3, 
pCI-CARD-4L, pCI-CARD-4S, pCI-CARD4LnoLRR, pCI-CARD4LCARDonly 
and. as expected, pCI-APAF enhanced the level of caspase 9-induced apoptosis 
to 20 or less beta galactosidase positive cells per experiment from about 100 beta 
5 glactosidase positive cells per experiment. 

This experiment demonstrated that CARD-4 can enhance caspase 9- 
mediated apoptosis because coexpression of CARD-4L or CARD-4S with 
caspase-9 dramatically increases caspase-9 mediated apoptosis. Furthermore, the 
CARD-4 CARD domain (SEQ ID NO: 10) is necessary and sufficient for CARD- 

10 4-mediated enhancement of caspase-9-potentiated apoptosis because CARD-4L 
lacking its CARD domain (pCI-CARD-4LnoCARD) does not enhance caspase-9- 
mediated apoptosis while the CARD-4 CARD domain expressed alone (pCI- 
CARD4LCARDonly) does induce caspase-9 mediated apoptosis. In addition, the 
LRR present in CARD-4 is not required for CARD-4 enhancement of caspase-9- 

15 mediated apoptosis because expression of a CARD-4 protein lacking the LRR 
(pCI-CARD4LnoLRR) still enhances caspase-9-mediated apoptosis. The CARD- 
4 NBS is not sufficient for CARD-4 enhancement of caspase-9-mediated 
apoptosis because expression of the CARD-4 NBS only (pCI-CARD4NBSonly) 
does not enhance caspase-9 mediated apoptosis. This experiment also 

2 0 demonstrates that CARD-3 can enhance caspase-9-mediated apoptosis. 

As detailed below in Example 1 2, CARD-4 does not appear to interact 
directly with caspase-9, suggesting that potentiation of caspase-9 activity by 
CARD-4 is mediated by activation of downstream pathways. 

2 5 Example 1 1 : Identification and tissue distribution of mRNA species expressed 

by the human CARD-4 gene. 

Northern analysis of mRNAs extracted from adult human tissues 
revealed a 4.6 kilobase mRNA band that was expressed in most tissues examined. 
Highest expression was observed heart, spleen, placenta and lung. CARD-4 was 

3 0 also observed to be expressed in fetal brain, lung, liver and kidney. Cancer cell 
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lines expressing the 4.6 kilobase CARD-4 mRNA include HeLa, K562, Molt4, 
SW480, A549 and melanoma. A larger 6.5 to 7.0 kilobase CARD-4 mRNA was 
expressed in heart, spleen, lung, fetal lung, fetal liver, and in the Molt4 and 
SW480 cell lines. 

5 

Example 12: Physical association of CARD-4 with CARD-3. 

CARD-4-specific PCR primers with the 3' primer encoding the HA 
epitope tag were used to amplify the CARD-4L gene epitope tagged with HA and 
this PCR product was cloned into the mammalian expression vector pCI. CARD- 

1 0 3-specific PCR primers with the 5' primer encoding the MYC epitope tag were 

used to amplify the CARD-3 gene epitope tagged with MYC and this PCR 
product was cloned into the mammalian expression vector pCI. CARD-3-specific 
PCR primers with the 5' primer encoding the MYC epitope tag were used to 
amplify the CARD-3 gene lacking the CARD domain (SEQ ID NO:6) epitope 
1 5 tagged with MYC and this PCR product was cloned into the mammalian 
expression vector pCI. Caspase 9-specific PCR primers with the 3' primer 
encoding the MYC epitope tag were used to amplify the caspase 9 gene epitope 
tagged with MYC and this PCR product was cloned into the mammalian 
expression vector pCI. In three transfection experiments, 293T cells 

2 0 coexpressing pCI-CARD-4LcHA and either pCI-CARD3nMYC, pCI- 

CARD3noCARDnMYC, or pCl-casp9cMYC were created. Cells from each 
transfected line were lysed and an immunoprecipitation procedure was carried out 
on each lysate with an anti-MYC epitope tag antibody to precipitate the CARD- 
4LcHA expressed by each cell line and any physically associated proteins. 
25 lmmunoprecipitated proteins were separated by electrophoresis on denaturing 
polyacrylamide gels, transferred to nylon filters, and probed with an anti-HA 
epitope tag antibody in a Western blotting experiment to determine whether the 
MYC-tagged protein that was coexpressed with the CARD-4LcHA protein had 
coimmunoprecipitated with the CARD-4LcHA protein. In this experiment. 

3 0 CARD-3 was found to coimmunoprecipitate with CARD-4 while CARD-3 
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lacking its CARD domain and caspase-9 did not coimmunoprecipitate with 
CARD-4. This experiment demonstrates that CARD-4 and CARD-3 physically 
associate and that CARD-3 requires its CARD domain to associate with CARD- 
4. In addition, CARD-4 appears to not associate with caspase-9. 

5 

Example 13: CARD-4 Genomic Sequence 

Figure 18 is depicts the 32042 nucleotide genomic sequence of 
CARD-4 (SEQ ID NO:63). This sequence is based the CARD-4 cDNA sequence 
described above and a BAC sequence (DBEST Accession No. AC006027). The 

1 0 CARD-4 cDNA sequence described above was used to correct three errors in the 
BAC sequence, including one error resulting in a frameshift. The CARD-4 
genomic sequence of Figure 18 includes the following introns and exons: exon 1: 
nucleotides 364-685, encoding amino acids 1-67 (start codon at nucleotides 485- 
487); intron 1: nucleotides 686-2094; exon 2: nucleotides 2095-2269, encoding 

15 amino acids 67-126; intron 2: nucleotides 2270-4365; exon 3: nucleotides 366- 
6190, encoding amino acids 126-734; intron 3: nucleotides 6191-9024; exon 4: 
nucleotides 9025-9108, encoding amino acids 734-762: intron 4: nucleotides 
9109-10355; exon 5: nucleotides 10356-10439, encoding amino acids 762-790; 
intron 5: nucleotides 10440-1 1181; exon 6: nucleotides 1182-11265, encoding 

2 0 amino acids 790-818; intron 6: nucleotides 1 1266-19749; exon 7: nucleotides 

19750-19833, encoding amino acids 818-846; intron 7: nucleotides 19834-21324; 
exon 8: nucleotides 21325-21408, encoding amino acids 846-874; intron 8: 
nucleotides 21409-24226; exon 9: nucleotides 24227-24310, amino acids 874- 
903; intron 9: nucleotides 2431 1-27948; exon 10: nucleotides 27949-28032, 

2 5 amino acids 903-930; intron 10: nucleotides 28033-31695; exon 1 1 : nucleotides 

3 1696-32024, encoding amino acids 930-953 (stop codon at nucleotides 31766- 
31768). 

The introns in the CARD-4 genomic sequence contain consensus 
splice donor and acceptor sites (Molecular Cell Biology, Darnell et al.. 

3 0 eds..l 996). The CARD-4 genomic sequence is useful for genetic identification 
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and mapping and identifying mutations, e.g., mutations is splice donor or splice 
acceptor sites. 

Example 14: Isolation and characterization of full-length murine CARD-5 and 
5 human CARD-5 

The amino acid sequence of the CARD domain of RAIDD (amino 
acids 1 to 94) was used to search a proprietary murine cDNA sequence database 
using the BLASTX program with the BLOSUM62 matrix and a protein word 
length of three. This search led to the identification of a murine clone, 
1 0 jtmaa010ht2, present in a coronary artery smooth muscle cell library. This clone 
encodes a protein designated CARD-5. The 761 nucleotide murine CARD-5 
cDNA of SEQ ID NO:60 has a 579 nucleotide open reading frame (SEQ ID 
NO:62) encoding a 193 amino acid protein (SEQ ID NO:61). The cDNA and 
protein sequences of murine CARD-5 are shown in Figure 19. 
1 5 Murine CARD-5 is predicted to be an intracellular protein having a 

molecular weight of 21.4 kDa prior to post-translational modification. 

Figure 20 depicts a hydropathy plot of murine CARD-5. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. The cysteine residues 
2 0 (cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

The murine CARD-5 nucleotide sequence was used to search a 
proprietary database of human cDNA sequences. This search led to the 
identification of a human CARD-5 cDNA clone, jthza027gl ltl, present in a 
2 5 testes library. 

The 740 nucleotide murine CARD-5 cDNA of SEQ ID NO:48 has a 
585 nucleotide open reading frame (SEQ ID NO:50) encoding a 1 95 amino acid 
protein (SEQ ID NO:49). The cDNA and protein sequences of human CARD-5 
are shown in Figure 21. 

30 
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Human CARD-5 is predicted to be an intracellular protein having a 
molecular weight of 2 1 .6 kDa prior to post-translational modification. 

Figure 22 depicts a hydropathy plot of human CARD-5. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
5 hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

Figure 23 depicts an alignment of the cDNA sequences of murine 
(SEQ ID NO:60) and human (SEQ ID NO:48) CARD-5. In this alignment the 
1 0 sequences are 68.2% identical. Figure 24 depicts an alignment of the amino acid 
sequences of murine (SEQ ID NO:61) and human (SEQ ID NO:49) CARD-5. In 
this alignment the sequences are 71.8% identical. 

Both murine and human CARD-5 include a CARD domain. The 
CARD domain of murine CARD-5 extends from amino acid 1 10 to 179 of SEQ 
15 ID NO:61 (SEQ ID NO:57). The CARD domain of human CARD-5 extends 
from amino acid 1 1 1 to 181 of SEQ ID NO:49 (SEQ ID NO:58). Figure 27 
depicts an alignment of the CARD domains of murine CARD-5 (SEQ ID 
NO:57), human CARD-5 (SEQ ID NO:58), and RAIDD (SEQ ID NO:70). 



2 0 Example 1 5 : Isolation and Characterization of full-length rat CARD-6 and 
human CARD-6 

A generalized CARD domain model was used to search a proprietary 
rat cDNA sequence database. This search led to the identification of a rat cDNA 
clone present in a sciatic nerve cDNA library. This clone encodes a protein 

2 5 desigated CARD-6. The 5252 nucleotide rat CARD-6 cDNA of SEQ ID NO:51 

has a 2715 nucleotide open reading frame (SEQ ID NO: 53) encoding a 905 
amino acid protein (SEQ ID NO:52). The cDNA and protein sequences of rat 
CARD-6 are shown in Figure 25. 

Rat CARD-6 is predicted to be an intracellular protein having a 

3 0 molecular weight of 100.2 kDa prior to post-translational modification. 
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Figure 26 depicts a hydropathy plot of rat CARD-6. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
5 lines just below the hydropathy trace. 

Rat CARD-6 contains a CARD domain which extends from amino 
acid 1 to amino acid 108 of SEQ ID NO:52 (SEQ ID NO:59). Rat CARD-6 also 
has a proline-rich c-terminus which extends from amino acid 698 to amino acid 
905 of SEQ ID NO:52 (SEQ ID NO:65). This proline-rich domain includes five 

1 0 putative SH3 binding sites. These binding sites have the sequence PXXP and are 
located at amino acids 710 to 713 (PAHP), 806 to 809 (PLRP). 819 to 822 
(PIPP), 857 to 860 (PPHP), and 881 to 884 (PSQP) of SEQ ID NO:52. 

The rat CARD-6 cDNA sequence described above was used to search 
a proprietary sequence database. This search led to the identification of a clone 

1 5 from a human muscle cell library encoding a carboxy-terminal portion of human 
CARD-6. A probe designed based on the sequence of this clone was used to 
screen a human adrenal gland library. This screening led to the identification of a 
clone encoding an amino-terminal portion of human CARD-6. The clone 
encoding an amino terminal portion of human CARD-6 contains a region 

2 0 encoding a CARD domain. This CARD domain-encoding sequence was used to 
screen a proprietary database. This screening led to the identification of a clone. 
jthAb086d02, present in an adrenal gland library, which encodes full length 
human CARD-6. 

The 4244 nucleotide human CARD-6 cDNA of SEQ ID NO:54 has a 

2 5 3111 nucleotide open reading frame (SEQ ID NO:56) encoding a 1 037 amino 

acid protein (SEQ ID NO:55). The cDNA and protein sequences of human 
CARD-6 are shown in Figure 28. 

N-glycosylation sites are present at amino acids 49-52, 415-418. and 
812-815 of SEQ ID NO:55. Human CARD-6 contains cAMP and cGMP- 

3 0 dependent protein kinase phosphorylation sites at amino acids 151-154 and 429- 
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432 of SEQ ID NO:55. Protein kinase C phosphorylation sites are present at 
amino acids 34-36, 57-59, 135-137, 154-156. 161-163, 298-300, 339-341. 346- 
348, 443-445, 664-666, 693-695, 746-748, 882-884, 905-907 and 951-953 of 
SEQ ID NO:55. Casein kinase II phosphorylation sites are present at amino acids 
5 6-9,28-31,40-43, 112-115, 135-138, 154-157,278-281,321-324,339-342,354- 
357. 642-645, 670-673, and 707-710 of SEQ ID NO:55. Tyrosine kinase 
phosphorylation sites are present at amino acids 37-34 and 163-1 69 of SEQ ID 
NO:55. An ATP/GTP-binding site motif A (P-loop) site is present at amino acids 
775-782 of SEQ ID NO:55. 

1 0 Figure 29 depicts a hydropathy plot of human CARD-6. Relatively 

hydrophobic regions are above the horizontal line, and relatively hydrophilic 
regions are below the horizontal line. Cysteine residues are indicated by short 
vertical lines just below the hydropathy trace. 

Human CARD-6 is predicted to have a molecular weight of 1 16.5 kD 

1 5 before post-translational modification. 

Human CARD-6 includes a CARD domain at amino acids 5-92 of 
SEQ ID NO:55 (SEQ ID NO:64). Figure 30 depicts an alignment of the CARD 
domain domain of human CARD-6 and a consensus CARD domain derived from 
a hidden Markov model (SEQ ID NO:67). 

2 0 Northern blot analysis of rat CARD-6 expression revealed that 

CARD-6 is expressed at a high level in the heart (6.5 kb transcript and a 7 kb 
transcript). This analysis also revealed that human CARD-6 is expressed in the 
brain, spleen, lung, liver, muscle, and kidney. 

2 5 Example 16: CARD-6 increases intracellular signaling. 

The studies described in this Example demonstrate that CARD-6 
expression can increase intracellular signaling. 

In a first study, a vector which expresses rat CARD-6 under the 
control of a CMV promoter was transiently transfected into 293 cells along with 

3 0 pNFkP-Luc (Stratagene Inc.. LaJolla, CA). The pNFic|3-Luc vector is a reporter 
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plasmid in which a luciferase gene is under the control of a promoter which 
includes a TATA box and five NFk(3 binding elements. Cotransfection of the rat 
CARD-6 expression vector increased luciferase expression by pNFicP-Luc 18- 
fold over that observed in the absence of the rat CARD-6 expression vector. This 
5 result indicates that CARD-6 stimulates a signaling pathway involving NF-kP. 

In a second study, a vector expressing CARD-6 under the control of 
the CMV promoter was transiently transfected into 293 cells along with pAP-1- 
Luc (Strategene, Inc.). The pAP-l-Luc vector is a reported plasmid in which a 
luciferase gene is under the control of a promoter which includes a TATA box 

1 0 and seven AP-1 binding sites. Co-transfection of the rat CARD-6 expression 

vector increased luciferase expression by pAP-l-Luc 4-fold over that observed in 
the absence of the rat CARD-6 expression vector. This result indicates that 
CARD-6 stimulates a signaling pathway involving AP-1 . 

Additional studies suggest that CARD-6 can stimulate 

1 5 phosphorylation of CHOP (GADD1 53), possibly by activating the stress 
activated kinase, JNK7p38. 

Example 17: Deposit of Clones. 

A plasmid containing a cDNA encoding human CARD-3 (pXEl 7A) 
2 0 was deposited with the American Type Culture Collection (ATCC), Manasass, 
VA on May 14, 1998, and assigned Accession Number 203037. 

A plasmid containing a cDNA encoding human CARD-4L (pC4Ll ) 
was deposited with the American Type Culture Collection (ATCC), Manasass, 
VA on July 7, 1998, and assigned Accession Number 203035. 

2 5 A plasmid containing a cDNA encoding human CARD-4S (pDB33E) 

was deposited with the American Type Culture Collection (ATCC). Manasass, 
VA on May 14, 1998, and assigned Accession Number 203036. 

A plasmid containing a cDNA encoding murine CARD-5 
(EpMC5) was deposited with the American Type Culture Collection (ATCC). 

3 0 Ivlanasass. VA on June 11.1 999. and assigned Accession Number PTA-2 12. 
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A plasmid containing a cDNA encoding rat CARD-6 (EpRC6) was 



deposited with the American Type Culture Collection (ATCC), Manassas, VA. 
on June 1 1 , 1999, and assigned Accession Number PTA-21 1 . 



5 CARD-5, a clone (EpCH6e) containing a cDNA molecule encoding an amino 
terminal portion of human CARD-6, a clone (EpHC6c) containing a cDNA 
molecule encoding a carboxy terminal portion of human CARD-6, and a clone 
(EpHC6) containing a cDNA molecule encoding human CARD-6 were deposited 
with the American Type Culture Collection (ATCC) Manassas, VA on June 1 1. 

1 0 1 999, as a composite deposit and assigned Accession Number PTA-2 13. To 

distinguish the strains and isolate a strain harboring a particular cDN A clone, one 
can first streak out an aliquot of the mixture to single colonies on nutrient 
medium (e.g., LB plates) supplemented with 100 ug/ml ampicillin, grow single 
colonies, and then extract the plasmid DNA from a selected colony using a 

15 standard minipreparation procedure. Next, one can digest a sample of the DNA 
minipreparation with a combination of the restriction enzymes Sal I and Not I and 
resolve the resultant products on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digestion will liberate DNA fragments as 
follows: 



A clone (EpHC5) containing a cDNA molecule encoding human 



20 



Human CARD-5 (EpHC5) 



0.6 kb and 3.0 kb 



Human CARD-6 amino- 
terminal portion (EpHC6e) 
(amino acids 1-279) 



1 .0 kb and 4.3 kb 



25 



Human CARD-6 carboxy 
terminal portion (EpHC6c) 
(amino acid 93-1037) 



3.8kband3.0kb 



30 



Human CARD-6 (EpHC6) 
(amino acids 1-1037) 



4.2 kb and 3.0 kb 
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Equivalents 

Those skilled in the an will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific 
embodiments of the invention described herein. Such equivalents are intended to 
5 be encompassed by the following claims. 

What is claimed is: 
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1 . An isolated nucleic acid molecule selected from the group 
consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence 
which is at least 55% identical to the nucleotide sequence of SEQ ID NO:l, 3, 7, 

5 9, 25, 27, 38, 40, 42, 48, 50, 51, 53, 54, 56, 60, 62. or the cDNA insert of the 

plasmid deposited with the ATCC as any of Accession Numbers 

or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 
nucleotides of the nucleotide sequence of SEQ ID NO:l, 3, 7, 9, 25, 27, 38, 40, 

10 42, 48, 50, 51, 53, 54, 56, 60, 62, or the cDNA insert of the plasmid deposited 

with the ATCC as any of Accession Numbers , or a complement 

thereof; 

c) a nucleic acid molecule which encodes a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2, 8, 26, 39, 41, 43, 49, 52, 

15 55, 61, or amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers ; 

d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:2, 8, 26. 39. 41, 
43, 49, 52, 55. 61 , or the polypeptide encoded by the cDNA insert of the plasmid 

2 0 deposited with the ATCC as any of Accession Numbers , 

wherein the fragment comprises at least 1 5 contiguous amino acids of SEQ ID 
NO:2, 8, 26, 39, 41, 43, 49, 52, 55, 61, or the polypeptide encoded by the cDNA 
insert of the plasmid deposited with the ATCC as any of Accession Numbers 
; and 

2 5 e) a nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, 8, 26, 39, 41, 43,49, 52, 55, 61. or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC as any of Accession 
Numbers , wherein the nucleic acid molecule hybridizes to a 

30 
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nucleic acid molecule comprising SEQ ID NO:l, 3, 7, 9, 25, 27, 38, 40, 42, 48, 
50, 51, 53, 54, 56, 60, 62, or a complement thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, which is 
5 selected from the group consisting of: 

a) a nucleic acid comprising the nucleotide sequence of SEQ ID 
NO: 1 , 3, 7, 9, 25, 27, 38, 40, 42, 48, 50, 5 1 . 53, 54, 56, 60, 62, or the cDNA 
insert of the plasmid deposited with the ATCC as any of Accession Numbers 

, or a complement thereof; and 

10 b) a nucleic acid molecule which encodes a polypeptide 

comprising the amino acid sequence of SEQ ID NO:2, 8, 26, 39, 41. 43, 49. 52, 
55, 61, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers . 

15 3. The nucleic acid molecule of claim 1 further comprising vector 

nucleic acid sequences. 

4. The nucleic acid molecule of claim 1 further comprising 
nucleic acid sequences encoding a heterologous polypeptide. 

20 

5. A host cell which contains the nucleic acid molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian host cell. 

2 5 7. A non-human mammalian host cell containing the nucleic acid 

molecule of claim 1 . 
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8. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, 8, 26, 39, 41, 43, 49, 52, 55, or 61, wherein the 
fragment comprises at least 1 5 contiguous amino acids of SEQ ID NO:2, 8, 26, 

5 39. 41,43, 49, 52, 55, or 61; 

b) a naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2. 8, 26, 39, 41 . 43, 49. 52. 
55, or 61, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers , 

1 0 wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes 
to a nucleic acid molecule comprising SEQ ID NO:l, 3, 7, 9, 25. 27. 38, 40, 42, 
48. 50, 51, 53, 54, 56, 60, 62, or a complement thereof under stringent conditions: 
and 

c) a polypeptide which is encoded by a nucleic acid molecule 

15 comprising a nucleotide sequence which is at least 65% identical to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NO:l, 3, 7, 9, 25. 27, 38, 40, 42. 
48. 50, 51, 53, 54. 56, 60, 62, or a complement thereof. 

9. The isolated polypeptide of claim 8 comprising the amino acid 
2 0 sequence of SEQ ID NO:2, 8. 26, 39, 41 , 43. 49, 52, 55. or 61 . 

10. The polypeptide of claim 8 further comprising heterologous 
amino acid sequences. 

2 5 1 1 . An antibody which selectively binds to a polypeptide of claim 8. 

12. A method for producing a polypeptide selected from the group 
consisting of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID 

3 0 NO:2, 8. 26. 39. 41, 43, 49, 52. 55, 61, or • he amino acid sequence encoded by 
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the cDNA insert of the plasmid deposited with the ATCC as any of Accession 

Numbers ; 

b) a polypeptide comprising a fragment of the amino acid 
sequence of SEQ ID NO:2, 8, 26, 39, 41, 43, 49, 52, 55, 61, or the amino acid 
5 sequence encoded by the cDNA insert of the plasmid deposited with the ATCC 

as any of Accession Number , wherein the fragment comprises at 

least 15 contiguous amino acids of SEQ IDNO:2, 8, 26, 39. 41, 43, 49, 52, 55, 
61 . or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers ; and 

10 c) a naturally occurring allelic variant of a polypeptide 

comprising the amino acid sequence of SEQ ID NO:2, 8, 26. 39, 41, 43, 49, 52, 
55, 6 1 . or the amino acid sequence encoded by the cDNA insert of the plasmid 

deposited with the ATCC as any of Accession Numbers , 

wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes 

15 to a nucleic acid molecule comprising SEQ ID NO:l, 3, 7, 9, 25, 27, 38, 40, 42, 
48. 50, 5 1 , 53, 54, 56, 60, 62, or a complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under conditions in 
which the nucleic acid molecule is expressed. 

2 0 1 3. A method for detecting the presence of a polypeptide of claim 

8 in a sample, comprising: 

a) contacting the sample with a compound which selectively 
binds to a polypeptide of claim 8; and 

b) determining whether the compound binds to the polypeptide in 

2 5 the sample. 

14. The method of claim 13, wherein the compound which binds 
to the polypeptide is an antibody. 
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1 5. A kit comprising a compound which selectively binds to a 
polypeptide of claim 8 and instructions for use. 

16. A method for detecting the presence of a nucleic acid molecule 
5 of claim 1 in a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer 
which selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to 
a nucleic acid molecule in the sample. 

10 

1 7. The method of claim 1 6. wherein the sample comprises 
mRN A molecules and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a 
15 nucleic acid molecule of claim 1 and instructions for use. 

1 9. A method for identifying a compound which binds to a 
polypeptide of claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a polypeptide of 
2 0 claim 8 with a test compound; and 

b) determining whether the polypeptide binds to the test 

compound. 

20. The method of claim 19, wherein the binding of the test 

2 5 compound to the polypeptide is detected by a method selected from the group 
consisting of: 

a) detection of binding by direct detecting of test 
compound/polypeptide binding; 

b) detection of binding using a competition binding assay; 
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c) detection of binding using an assay for CARD-3, CARD-4, 
CARD-5, or CARD-6-mediated signal transduction. 



21 . A method for modulating the activity of a polypeptide of claim 
5 8 comprising contacting a polypeptide or a cell expressing a polypeptide of claim 
8 with a compound which binds to the polypeptide in a sufficient concentration to 
modulate the activity of the polypeptide. 



22. A method for identifying a compound which modulates the 
1 0 activity of a polypeptide of claim 8, comprising: 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of 
the polypeptide to thereby identify a compound which modulates the activity of 
the polypeptide. 

15 
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-:CCC — -j i GACCTAGTG - ~ GCGGGGCrwiAA<-wGGGTw - - GCCGGGCTCGCi 1 CGTGCAGGGGCGTA - " 
TTGGGCGCCTGAGCGCGGCGTGGGAG\.C - - -jGGAGCCGCCGCAGCAGGGGGCACACCCGGAACCG 
_ — — — — £CCGGGACCATGAACGGGGAGGCCATCTGCAGCGCCCTGCCCACCAT"TC C TTACCA 
-AAAC - _ jwCGACCTGCGCTACCTGAGCv_G^GGCGCCTCTGGCACT j±GTCGTCCCCCCGCCACG 

- AGACTTwCGCGTCCAGGTGGCCGTGAAGCACCTGCACATCCACAt. x wCGCTGCTCGACAGTGAA 
.-.•-jAAA<jGATGTC . lAAGAGAAGCTGAAA j. _ _ _ ACACAAAGC1 AGAi ± — AGTTACATTTTTCCAAT 

- TTGGGAAi _ . GCAATGAGCC IGAAx ... -GGGAATAGi .ACiGAATACATGC ZAAATGGAT.CAT 

- AAATGAACT~CrACATAGGAAAACTGAATATC_-GATG j. j.GwI'.GGCCATTGAGAt zZC TATC 
TTGCATGAAAi TGCCw. IGGTGTAAATTACCTGCACAATATGACTCwX'CCx * -ACTTrCATCATGA 
CTTTGAAGACTCAGAATATCTrrATTGGACAATGAATTTCATGTTAAGAx ^ jCAGATTTTGGTTTAT 
TAAAGTGGCGCATGATGTCCCTCTCACAGTCACGAAGTAGCAAATu L GCACCAGAAGGAGGGACA 
ATTlATCTATATGCCACCTGAAAACTATGAACCTGGACAAAAATCAAGGGCCAGTATCAAGCACGA 
TATATATAGCTATGCAGTTATCACATGGGAAGTGTTATCCAGAAAACAGCCrTTTGAAGATGTCA 
ZCAATCCTTTGCAGATAATGTATAGTGTGTCACAAGGACATCGACCT jTTATTAATGAAGAAAGT 
TTGCCATATGATATACCTCACCGAGCACGTATGATCTCrCTAATAGAAAGTGGATGGGCACAAAA 
~CCAGATGAAAG^CCl\TCTTTCTrrAAAATGTTTAATAGAACTTGAACCAGTT- GAGAACATTTG 
AAGAGATAAC T . GAAGCTGTTATTCAGCTAAACiAAAACAAAGTTACAGAGTGTTTCAAGT 

i.Grt\j^jAATCAXGTGGA iC«. GT GAGGT — — n.* v^AAAA j 
_ G GT G G GAG CT G GT GAAGAGAATGA^ l"i J I .ATCTAGAAAAGC* — AAGA CT j '1' 'I'Aj. * . * ATGAAG 
GTGCATCACTGTCCTGGAAATCACAGTTGGGATAGCACCAa * . — xGGATc I CAAAGGGn^XGGAi i 
TTGTGATCACAAGACCATTCCATGCTCTTCAGCAATAATAAATCCACTCT 

CIAGAACGTCTGCAGCCTGGTATAGCCZAGCAGTGGATCCAGAGCAAAAGGGAAGACATTGTGAAC 
CAAATGACAGAAGCCTGCCTTAACCAGTCGCTAGATGCCCTTCTGTCCAGGGACT^ 
AGAGGACTATGAA C 1 . Ui *A GTACCAAGCCTACAAGGACCrCAAAAGTCAGACAATTACTAGACA 
CTACTGACATCCAAGGAGAAGAATTTGCCAAAGTTATAGTACAAAAA1T 
ATGGGTCT^-^GCCTTTACCCGGAAATACTTGTGGTTTCTAGATC^^ 

AAATAAAAGCATGTAAGTGACTG' l"! . ' J T CIAAGAAGAAATGTGTTTCATAAAAGGATATTTATAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO:l) 
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7GGAAGAGAC AGARCAA77C CAGAAW7AAA 77G3AA77GA 
.-.GATTTXA.CC AATGT7G777 7AAAA7A777, 7AAC77CAAA GAATGA7GCC AGAACTTtfAA 
AAGGGS.C7GC GCAGAG7AGC AGGGGCCC7G 3AGGGCGCGG CCTGAATCCT GATTGCCC77 
Z7GC7GAGAG GACACACGCA GCTGAAGATG AATT7GGGAA AAGTAGCCGC 77CCTA C. " 
AAC7A7GGAA GAGCAGGGCC ACAGTGAGAT GGAAAXAAXC CCATCAGAGT C7CACCC7GA 
CATTCAATTA C7GAAAAGCA ATCGGGAACT 777GGTCACT CACATCCSCA ATACTCAGTG 
7C7GG7GGAC AACXTGC7GA AGAA7GACTA C77C7CGGCC GAAGATGCGG AGATTGTGTG 
TGCC7GCGCC ACCCAGCC73 ACAAGG7CCG CAAAATTCTG GACCTGGTAC AGAGCAAGGG 
CGAGGAGG7G TCCGAGTTGT 7CC7C7AC7T GC7CGAGCAA CTCGCAGATG CCTACGTGGA 
CC7CAGGCC7 7GGC7GC7GG AGA7CGGC77 C7CCGC77CC CTGCTCACTC AGAGCAAAGT 
-G7GG7CAAC ACTGACCGAG 7GAGCAGG7A TACCCAGCAG CrGCGACACC A7C7GGGCCG 
7GAC77ZAAG 77CG7GC7G7 GC7A7GCCCA GAAGGAGGAG C7GC7GC7GG AGGAGA7C7A 
CATGGACACC ATCATGGAGC 7GG77GGC77 7AGCAA7GAG AGCC7GGGCA GCCTGAACAG 
CC7GGCC7GC C7CC7GGACC ACACCAC GGG CAXCC7GAAX GAGCAGGG7G AGACCAXC77 
CATCC7GGGT GATGC7GGGG 7GGGCAAGTC CATGCTGC7A CAGCGGC7GC AGAGCCTG7G 
GGCCACGGGC CGGC7AGACG CAGGGGTCAA ATTC7TCT7C CACTTTCGCT GCCGCATG7T 
CAGC7GC77C AAGGAAAG7G ACAGGC7G7G 7C7GCAGGAC C7GCTCT7CA AGCACTAC7G 
C7ACCGAGAG CGGGACCCCG AGGAGG7G77 7GCC77GC7G C7GCGC77CC C7CACG7GGC 
CC7C77GACC 77CGA7GGCC 7GGACGAGCT GCAC7CGGAC 77GGACC7GA GCCGCG7; 



7GACAGC7C7 7GCCCG7GGG AGCCTGCCCA CCGCCTGG7C 77GCTGGCCA ACC7GC7CAG 
T 3GGAAGC7G C7CAAGGGGG C7AGCAAGC7 GC7CACAGCC C3CACAGGCA 7CGAGG7CCG 
GCGCCAG77C C7GCGGAAGA . AGGTGC77CT CCGGGGC77C 7CCCCCAGCC ACCTGCGCGC 
C7A7GCCAGG AGGATG77CC CCGAGCGGGC CC7GCAGGAC CGCCTGC7GA GCCAGCTGGA 
SGCCAACCCC AACCTCT3CA GCC7G7GC77 7GTGCCCC7C 77CTGC7GGA 7GATCT7CCG 
G7GC777CAG CACTTCCGTG C7GCC7TTGA AGGC7CACCA CAGC7GCCCG AC7GCACGA7 
GACCG7GACA GATGTC77CC 7GC7GGTCAC 7GAGG7CCAT C7GAACAGGA 7GCAGCCCAG 
CAGCC7GGTG CAGCGGAACA CACGCAGCCC AGTGGAGACC CTCCACGCCG GCCGGGACAC 
TC7G7GC7CG C7GGGGCAGG 7GGCCCACCG GGGCA7GGAG AAGAGCC7CT 77G7C77CAC 
CGAGGAGGAG G7GCAGGCC7 CCGGGCTGCA GGAGAGAGAC ATGCAGC7GG GCTTCC7GCG 
GGC777GCCG GAGCTGGGCC C73GGGG7GA CCAGCAGXCC 7ATGAG7TTT 77GACC7CAC 
CC7CCAGGCC 77C777ACAG CC77C77CG7 GGTGC7GGAC GACAGGG7GG GCACTCAGGA 
3C73G7GAGG 77C777GAGG AG7GGA7GCG CGC7GCGGGG GCAGCGACCA CG7CC7GC7A 
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4/5B 

-rcrrcccrrc G7CCCG77CC .\gtgc~~ca gggcagtggt ccggcgcggg aagacc7C7T 

lAAGAACAAG GATCAC7TCC AGTTCACCAA CrTCTTCCTG 7GCGGGC7G7 7GTCCAAAGC 
IAAACAGAAA C7CC7GCGGC A7CTGG7GCC CGCGGCAGCC C7GAGGAGAA AGCGCAAGGC 
;~37GGGCA CACCTGTTTT CCASCC7GCG GGGC7ACC7G AAGAGCCTGC CCCGCG7TCA 
I-G7CGAAAGC 7TGAACCAGG TGCAGGCC-.7 GCCCACG77C A7CTGGA7GC 73CGCTGCA7 
:7ACGAGACA CAGAGCCAGA AGG7GGGGCA GC7GGCGGCC AGGGGCATCT GCGCCAAC7A 
ZCTCAAGC7G ACCTAC7GCA ACGCC7GC7C 3GCCGAC7GC AGCGCCC7CT CCTTCG7CC7 
3CA7CAC77C CCCAAGCGGC 7GGCCCTAGA CC7AGACAAC AACAA7C7CA ACGAC7ACGG 
CG7GCGGGAG C7GCAGCCC7 GCT7CAGCCG CC7GAC7G77 C7CAGAC7CA GCGTAAACCA 
GATCAC7GAC GG7GGGGTAA AGGTGC7AAG CGAAGAGC7G ACCAAATACA AAATTG7GAC 
C7A777GGG7 T7ATACAACA ACCAGA7CAC CGA7G7CGGA GCCAGG7ACG 7CACCAAAA7 
CZTGGATGAA T3CAAAGGCC 7CACGCA7C7 TAAAC7GGGA AAAAACAAAA 7AACAAG7GA 
AGGAGGGAAG 7A7CTCGCCC 7GGC7G7GAA GAACAGCAAA 7CAA7C7CTG AGG7TGGGA7 
G7GGGGCAA7 CAAGT7GGGG A7GAAGGAGC AAAAGCCTTC GCAGAGGCTC 7GCGGAACCA 
CCCCAGC77G ACCACCC7GA G7CT7GCG7C CAACGGCA7C 7CCACAGAAG GAGGAAAGAG 
CCTTGCGAGG GCCCTGCAGC AGAACACG7C TC7AGAAATA C7GTGGC7GA CCCAAAATGA 
ACTCAACGA7 GAAGTGGCAG AGAG77TGGC AGAAA7G77G AAAG7CAACC AGACGTTAAA 
GCAT77A7GG C77ATCCAGA ATCAGA7CAC AGCTAAGGGG ACTGCCCAGC 7GGCAGATGC 
G77ACAGAGC AACACTGGCA 7AACAGAGA7 T7GCCTAAA7 GGAAACC7GA TAAAACCAGA 
jGAGGCCAAA G7CTATGAAG A7GAGAAGCG GATTATC7GT 77CTGAGAGG A7GCTT7CC7 
-77CA7GGGG ..lll GCCCT GGAGCC7GAG CAGCAAA7GC CAC7CT3GGC AGTC7777G7 
27CAG7G7C7 7AAAGGGGCC 7GCGCAGGCG GGAC7A7CAG GAG7CCACTG CC7YCA7GA7 
GCAAGCCAGC 77CCTG7GCA GAAGG7CTGG 7CGGCAAACT CCC7AAGTAC CCGCTACAA7 
7C7GCAGAAA AAGAATGTG7 C7TGCGAGC7 G7TGTAGTTA CAGTAAA7AC ACTGTGAAGA 
GAAAAAAAAA ACGGACGCGT GG (SEQ ID HO: 7) 



FIG- 3 (page 2 of 2) 



BNSDOCtO'. <WO 0100826A2J_> 



WO 01/00826 



PCT/USOO/17691 



meec ghs ehe 1 1 ? s eshrki cili-csnrellvtki rntq clvdnllxjidyfsasdaezvcacrtqp 
z. xvrf.i ltjlvq 3 kgeev3 z? fl^lc cij^ayvdijipwllei gr s p s ll7qs kvvvntl pvsryt 
^clf-nhlgrdsxcfvlcyaqkeellleeiy!^^ 

etifzlgdagvgksmllgrlqslwatgriz:agvkfffhfrc^fscfkesdrlclqdllfkhycy 
?eri* peevfafllrfphvalftfe'gl:: slhsdldls rvpds3 cpwepahplvllanllsgkllxg 

ASKlirARTGIEVPRQFIJlI^VLIJlGFS.^ 

?LF~7I I FRCFQHFRAAFEGSFQLPDCTMTLTZ-VF LLVTEVHLNRMQP S SLVQRNTRS PVETLHA 
GRD7LC3LGQVAHRGMEKSL?*vTTQEE T /CASGLQERDMQLGFIJUU J ?ELG?GG~CQSYEFFHLTL 
;AFF7AFFLVI^DRVG7CE:JJIF?QE>JMPFAGAATT3C/F?F 

C; FTNLFLCGLL5 KAKQKLIJIHLVPAAAI^JIXRKALWAHLFS 3LRGYLKSLPRVQVES FNQVQAMP 
TF I WMLRC - YSTQSQ KVGQLAARG I CANYLI-CLTYCNACSADCS ALS FVLKHFP KRLALDLDNNNL 
NDYGVRELQ PCFSRLTVLRLS VNQ I TTGGVKVL5EELTKYKI VTYLGLYNNQ I TTJVGARYVTKIL 
D E C KG LTHLKLGKNKI TS EGG KYLALAVKNS K3 1 3 EVGMWGNQVGD EGAKAF AEALKUH P S LTTL 
3 LAS NG I S TEGGKSLARALQ C.NTS LEI LWLTQ^LNDEVAESIAEMLKVNQTI*KHLWLI QNQ I TA 
KGTAQLADALQSNTGITEI CLNGNL I KPEEAXVYEDEKRI I CF ( SE q ^ N0: 8) 
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********'********* rtww^^w,^,, _ — ^ ^ * r ***,*x *m^**\ ***——***************** — - ** - m-,,^—^- , ■ ■ - M i , ■ 

..-iLuuji' — _^v_i.vjAACipiri.a.>-r.-».\_i^-i.>_j. - >_ ~ uuuu^cAAGAi GCGGAGA77 G7G 7 

37GCCTGCCCCACCCAGCC7GACAAGG7CCGCAAAA77C7GGACC7GG7ACAGAGCAAG 

3GCGAGGAGG7 G7G CGAG7T777CG7C7ACTrGC7CCAGCAAC7CGCAGA7GCC7ACG7 

--GACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCGTGCTCACTCAGAGCA 

AAG7C37GG7CAACACTGACCCAGTGAGCAGGTA7ACCCAGCAGCTGCGACACCA7C7G 

3GCCG7GAC7CCAAG77CG7GC7G7GC7ATGCCCAGAAGGAGGAGG7GC7GG7GGAGGA 

:-A7C7ACA7GGACACCA7CA7GGAGC7GG77GGC7TCAGCAA7C-AGAGCC7GGGCAGCC 

rGAACAGCC7GGCCTGCC7CC7GGACCACACCACCGGCA7CC7CAA7GAGCAGGG7GAG 

ACCA7C7TCA7CC7GGG7GA7GC7GGGG7GGGCAAG7CCATGC7GC7ACAGCGGC7GCA 

Z-AGCCTCTGGGCCACGGGCCGGCTAGACGCAGGGG7CAAATTC7TC7TCCAC7T7CGC7 

3CCGCA7G77CAGC7GC77CAAGGAAAG7GACAGGC7G7G7C7GCAGGACC7GC7C77C 

AAGCAC7AC7GC7ACCCAGAGCGGGACCCCGAGGAGG7G777GCC77CC7GC7GCGC77 

CCCCCACG7GGCCG7C77CACC77CGA7GGCC7GGACGAGC7GCAC7CGGAC7TGGACC 

TGAGGCGGG7GCCTGACAGC7GC7GCCCC7GGGAGCC7GCCCACCCCC7GG7C77GC7G 

GCCAACC7GC7CAG7GGGAAGC7GC7GAAGGGGGC7AGCAAGC7GC7CACAGCCCGCAC 

AGGC\TCGAGG7CCCGCGCCAG77GG7GCGGAAGAAGG7GC7TCTCCGGGGC77C7CCC 

:CAGCCACC7GCGCGCC7ATGCCAGGAGGA7G77CCCCGAGCGGGCCC7GCAGGACCGC 

:7GC7GAGCCAGC7GGAGGCGAACCCCAACC7C7GCAGCC7G7GC7CTG7GCCCC7C77 

-7GC7GGA7CA7C7TCCGG7GC77GCAGCAC7TCCG7GC7GCC7TTGAAGGC7CACCAC 

AGC7GCCCGAC7GCACGA7GACCCTGACAGA7G7C7TCC7CC7GG7CAC7GAGG7GCA7 

C7GAACAGGATGCAGCCCAGCAGCG7GG7GCAGCGGAACACACGCAGCCCAG7GGAGAC 

CC7CCACGCCGGCCGGGACAC7C7G7GC7CGC7GGGGCAGG7GGCCCACCGGGGCA7GG 

AGAAGAGCC7C7T7G7C77CACCCAGGAGGAGG7GCAGGCC7CCGGGC7GCAGGAGAGA 

GACA7GCAGC7GGGC77CC7GCGGGC777GCCGGAGC7GGGGCCCGGGGG7GACCAGCA 

G7CC7ATGAGTTTTTCCACC7CAGCC7CC7CACC7GTAAAAC7GGGATCCCAG7A7AGA 

C77TGGAAA7CAG7AGACACCA7ATGC7TCAAAAAACAGGGGC7ATTAAAATGACA7CA 

GGAGCCAGAAAG7CTCA7GGC7G7GC777C7C7TGAAG777ATACAACAACCAGA7CAC 

CGA7G7CGGAGCCAGACTGGGAAAAAACAAAATAACAAG7GAAGGAGGGAAG7ATCTCG 

C CC 7 GGC7G7GAAGAACAGCAAA7CAA7C7 C7GAGG77GGGA7G7GGGGCAA7CAAG7T 

GGGGA7GAAGGAGCAAAAGCC77CGCAGAGGC7C7GCGGAACCACCCCAGC77GACCAC 

CC7GAG7C77GCG7CCAACGGCA7C7CCACAGAAGGAGGAAAGAGCCT7GCGAGGGCCC 

TGCAGCAGAACACG7C7C7AGAAA7ACTG7GGC7GACCCAAAA7GAAC7CAACGA7GAA 

:-7GGCAGAGAG77TGGCAGAAA7G77GAAAG7GAACC\GACG77AAAGCA77TA7GGC7 

— ^ ********\ ***\ j, rj****y ***\ ^***i r*T\ ^T*' ' " ■ V "' 1'**^ < n » y ^^^ m iini « * «««««^m^^««« ********** 

3AC7ATCAGGAG7CCAC7GCC7CCA7GATGCAAGCCAGC77CC7GTGCAGAAGG7C7GG 

rCGGCAAACTCCCTAAG7ACCCGCTACAA77CTGCAGAAAAAGAATG7GTC77GCGAGC 

7G77G7AG7TACAGTAAA7ACAC7G7GAAGAGAC7T7A77GCCTATTA7AA7TA7T77T 

A7C7GAAGC7AGAGGAATAAAGC7G7GAGCAAACAGAGGAGGCCAGCC7CACC7CA77C 

CAACACC7GCCATAGGGACCAACGGGAGCGAG77GG7CACCGC7CT7TTCA77GAAGAG 

T7GAGGA7GTGGCACAAA G7TG G7GCCAAGC77C7TGAATAAAACG7G777GA7GGA77 

AG7ATTA7ACCTGAAA7A7777C7TCCT7C7CAGCACTT7CCCA7G7ATTGA7AC7GG7 

CGCAC7TCACAGCTGGAGACACCGGAGTA7G7GCAGTG7GGGA77TGACTCC7CCAAGG 

T7T7G7GGAAAG7TAA7G7CAAGGAAAGGA7GCACCACGGGC7777AA7777AA7CC7G 

GAG7C7CACTGTCTGC7GGCAAAGA7AGAGAA7GCCC7CAGCTCTTAGC7GG7C7AAGA 

A7GACGA7GCCTTCAAAA7GC7GC77CCAC7CAGGGC7TC7CC7C7GC7AGGC7ACCC7 

CC7C7AGAAGGC7GAG7ACCA7GGGC7ACAG7G7C7GGCC77GGGAAGAAG7GAT7C7G 

7CCCTCCAAAGAAA7AGGGCATGGC77GCCCC7G7GGCCC7GGCA7CCAAA7GGC7GC7 

T77G7C7CCC7TACC7CG7GAAGAGGGGAAG7C7C7TCCTGCC7CCCAAGCAGC7GAAG 

3G7GAC7AAACGGGCGCCAAGAC7CAGGGGA7CGGC7GGGAAC7GGGCCAGCAGAGCA7 

377GGACACCCCCCACCA7GG7GGGC77G7GG7GGC7GC7CCA7GAGGG7GGGGG7GA7 

AC7AC7AGA7CAC77GTCC7C77GCCAGC7CA77TG77AA7AAAATACTGAAAACACAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA (SEQ ID NO:25) 
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CCCGCGTCCGCGTCCCCGGACCATGGCGC7CTCCGGGCTCTTCTCTAGCTCTCAGCGGCT 
GCGAAGTC7GTNAACCTGGTGGCCAAGTGATTGTAAGTCAGGAGACTTTCCTTCGGTTTC 
TGCC777GA7GGCAAGAGG7GGAGA77G7GGCGGCGATTAOVGAAAACA7C7GGGAAGAC 
AAGTTGCTGTTTTTATGGGAATCGCAGGCTTGGAAGAGAC^GAAGC^TTCCAGAAATAA 
ATTGGAAATTGAAGATTTAAACAATGTTGTTTTAAAATATTCTAACTTCAAAGAATGATG 
CCAGAAACTTAAAAAGGGGCTGCGCAGAGTAGCAGGGGCCCTGGAGGGCGCGGCCTGAAT 
CCTGATTGCCCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGC 
CGCTTGCTACTTTAACTATGGAAGAGCAGGGCCACAGTGAGATGGAAATAATCCCATCAG 
AGTCTCACCCCCACZATTCAATTACTGAAAAGCAATCGGGAACTTCTGGTCACTCACATCC 
GCAATACTCAGTGTCTGGTGGACAACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATG 
CGGAGA77G7G7G7GCC7GCCCCACCCAGCC7GACAAGG7CCGCAAAA77C7GGACC7GG 
TACAGAGCAAGGGCGAGGAGGTGTCGGAGTTCTTCCTCTACTTGCTCCAGCAACTCGCAG 
ATGCCTACGTGGACCTCAGGCCTTGGCTC-CTGGAGATCGGCTTCTCCCCTTCCCTGCTCA 
CTCAGAGCAAAGTCGTGGTCAAC^CTGACCCAGTGAGGAGGTATACCCAGCAGCTGCGAC 
ACCATCTGGGCCGTGACTCCAAGTTCGTGCTGTGCTATGCCCAGAAGGAGGAGCTGCTGC 
TGGAGGAGATCTACATGGACACCATCATGGAGCTGGTTGGCTTCAGCAATGAGAGCCTGG 
GCAGCCTGAACAGCCTGGCCTGCCTCCTGGACCACACCACCGGCATCCTCAATGAGCAGG 
CTGCTTCAAGGAAAGTGACAGGCTG7GTCTGCAGGACCTGCTCTTCAAGCACTACTGCTA 
CCCAGAGCGGGACCCCGAGGAGG7GTTTGCCTTCCTGCTGCGCTTCCCCCACGTGGCCCT 
CTTCACCTTCGATGGCCTGGACGAGCTGCACTCGGACTTGGACCTGAGCCGCGTGCCTGA 
CAGCTCCTGCCCCTGGGAGCCTC-CGCACCCCC7GGTCTTGC7GGCCAACC7GCTCAGTGG 
GAAGC7GC7CAAGGGGGC7AGCAAGC7GC7CACAGCCCGCACAGGCA7CGAGG7CCCGCG 
CCAG77CC7GCGGAAGAAGG7GC77C7CCGGGGC77C7CCCCCAGCCACC7GCGCGCC7A 
TGCCAGGAGGA7G77CCCCGAGCGGGCCC7GCAGGACCGCC7GC7GAGCCAGC7GGAGGC 
CAACCCCAACCTC7GCAGCC7G7GC7C7G7GCCCC7C77C7GC7GGATCA7C77CCGG7G 
C7TCCAGCACTTCCG7GC7GCC777GAAGGC7CACCACAGC7GCCCGAC7GCACGA7GAC 
CC7GACAGATGTC77CC7CC7GG7CAC7GAGG7CCA7C7GAACAGGA7GCAGCCCAGCAG 
CC7GG7GCAGCGGAACACACGCAGCCCAG7GGAGACCC7CCACGCCGGCCGGGACAC7C7 
G7GC7CGC7GGGGCAGG7GGCCCACCGGGGCA7GGAGAAGAGCC7C777G7CT7CACCCA 
GGAGGAGG7GCAGGCC7CCGGGC7GCAGGAGAGAGACA7GCAGC7GGGC7TCC7GCGGGC 
TTTGCCGGAGC7GGGCCCC3GGGG7GACCAGCAG7CC7A7GAG77777CCACC7CACCC7 

FIG. 10 (Page 1 of 3} 



BNSDOCIO: <WO 0100826A2J_> 



WO 01/00826 



PCT/US00/17691 




CCAGGCC77CT77ACAGCC77C77CC7CG7GC7GGACGACAGGG7GGGCAC7CAGGAGCT 
GCTCAGGTTCTTCCAGGAGTGGA7GCCCCCTGCGGGGGCAGCGACCACGTCCTGCTATCC 
TGCC77CC7CCCG77CCAG7GCC7GCAGGGCAG7GG7CCGGCGCGGGAAGACC7C77CAA 
GAACAAGGATCAC77CCAG77CACCAACC7C77CCTG7GCGGGC7GTTGKCCAAAGCCAA 
ACAGAAAC7CC7GCGGCA7C7GG7GCCCGCGGCAGCCC7GAGGAGAAAGCGCAAGGCCC7 
GTGGGCACACCTGTTTTCCAGCCTGCGGGGCTACCTGAAGAGCCTGCCCCGCG7TCAGGT 
CGAAAGCTTCAACCAGGTGCAGGCCATGCCCACGTTCATCTGGATGCTGCGCTGCATCTA 
CGAGACACAGAGCCAGAAGG7GGGGCAGC7GGCGGCCAGGGGCA7C7GCGCCAACTACCT 
CAAGCTGACCTACTGCAACGCCTGCTCGGCCGACTGCAGCGCCCTCTCCTTCGTCCTGCA 
7CAC77CCCCAAGCGGC7GGCCC7AGACC7AGACAACAACAA7C7CAACGAC7ACGGCGT 
GCGGGAGCTGCAGCCC7GCTTCAGCCGCCTCACTGTTCTCAGACTCAGCGTAAACCAGAT 
CACTGACGGTGGGGTAAAGGTC-CTAAGCGAAGAGCTGACCAAATACAAAATTGTGACCTA 
TTTGGGTTTATACAACAACCAGATCACCGATGTCGGAGCCAGGTACGTCACCAAAATCCT 
GGATGAATGCAAAGGCCTCACGCATC77AAACTGGGAAAAAACAAAATAACAAGTGAAGG 
AGGGAAG7A7C7CGCCC7GGC7G7GAAGAACAGCAAA7CAATC7C7GAGG77GGGA7G7G 
GGGCAA7CLAAG77GGGGA7GAAGGAGCAAAAGCC77CGCAGAGGC7C7GCGGAACCACCC 
CAGC7TGACCACCC7GAG7C77GCGTCCAACGGCA7C7CCACAGAAGGAGGAAAGAGCC7 
7GCGAGGGCCCTGCAGCAGAACACGTC7CTAGAAATACTGTGGCTGACCCAAAATGAAC7 
CAACGA7GAAG7GGCAGAGAG777GGCAGAAA7GT7GAAAG7CAACCAGACG77AAAGCA 
777A7GGC77A7CCAGAA7CASA7CACAGC7WARGGGACTGCCCAGC7GGCAGATGCG77 
ACAGAGCAACACTGGCA7AACAGAGA777GCC7AAATGGAAACCTGATAAAACCAGAGGA 
GGCCAAAG7C7A7GAAGA7GAGAAGCGGA77A7C7G777C7GAGAGGA7GC777CC7G77 
CA7GGGG7777TGCCC7GGAGCC7CAGCAGCAAATGCCAC7Y7GGGCAG7C7777G7G7C 
AG7G7C77AAAGGGGCC7GCGCAGGCGGGAC7ATCAGGAGTCCAC7GGC7CCA7GA7GCA 
AGCCAGC77CCTGTGCAGAAGG7C7GG7CGGCAAACTCCCTAAGTACCCGC7ACAA77CT 
GCAGAAAAAGAATG7G7CT7GCGAGC7G7TG7AGT7ACAG7AAATACAC7G7GAAGAGAC 
777A77GCCTAT7ATAA77A77777A7C7GAAGC7AGAGGAA7AAAGC7G7GAGCAAACA 
GAGGAGGCCAGCCTCACC7CA77CCAACACCTGCCATAGGGACCAACGGGAGCGAG77GG 
7CACCGCTCTTTTCAT7GAAGAGTTGAGGA7G7GGCACAAAG7TGGTGCCAAGCTTCTTG 
AA7AAAACG7GT77GA7GGA77AG7AT7ATACCTGAAA7A77TTC77CC77C7CAGCACT 
77 C C CA7G7A7TGA7AC7GG7 CC CAC77CACAGC7GGAGACACCGGAG7A7G7GCAG7G7 
GGGA777GAC7CC7CCAAGG7777G7GGAAAG77AA7G7CAAGGAAAGGA7GCACCACGG 
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GCTTTTAATTTTAATCCTGGAGTCTCACTGTCTGCTGGCAAAGATAGAGAATGCCCTCAG 
CTCTTAGCTGGTCTAAGAATGACGATGCCTTCAAAATGCTGCTTCCACTCAGGGCTTCTC 
C7CTGCTAGGCTACCCTCCTCTAGAAGGCTGAGTACCATGGGCTACAGTGTCTGGCCTTG 
GGAAGAAGTGATTCTG7CCC7CCAAAGAAATAGGGCATGGCTTGCCCCTGTGGCCCTGGC 
ATCCAAATGGCTGCTTTTGTCTCCCTTACCTCGTGAAGAGGGGAAGTCTCTTCCTGCCTC 
CCAAGCAGCTGAAGGGTGACTAAACGGGCGCCAAGACTCAGGGGATCGGCTGGGAACTGG 
GCCAGCAGAGCATGTTGGACACCCGCCACCATGGTGGGCTTGTGGTGGCTGCTCCATGAG 
GGTGGGGGTGATACTACTAGATCACTTGTCCTCTTGCCAGCTCATTTGTTAATAAAATAC 
TGAAAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGG (SEQ ID NO : 38 ) 
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VNTDPVSRYTQQLRHHLGPXSKF^/LCYAQKEELLLEEIYMDTIMELVGFSNESLGSLNSL 
ACLLDHTTGI LNEQAAS RKVTGCVCRTC S S STTATQSGTP RRCLP S CCAS PTWP S SPSMA. 
WTSCTRTWT ( SEQ ID NO : 39 ) 
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CACGCGTCCGCGCTACTGCGGGAGCAGCGTCCTCCCGGGCCACGGCGCTTCCCGGCCCCG 
GCG7C Z CCGGACCATGGCGC7CTCC3GGCT C77CTCTAGC7CTCAGCGGC7GCGAAG7C7 
G7AAACC7GG7GGCCAAG7GA77G7AAG7CAGGAGAC777CC7TCGG777C7GCC777GA 
TGGCAAGAGG7GGAGA77G7GGCGGCGA77ACAGAAAACA7C7GGGAAGACAAG77GC7G 
77777A7GGGAATCGCAGGG77GGAAGAGACAGAAGCAA77CCAGAAATAAA7TGGAAA7 
7GAAGA7TTAAACAA7G77G7777AAAA7A77C7AAC77CAAAGAA7GA7GCCAGAAAC7 
TAAAAAGGGGC7GCGCAGAG7AGCAGGGGCCC7GGAGGGCGCGGCCTGAA7CC7GA7TGC 
CC77C7GC7GAGAGGACACACGCAGC7GAAGA7GAA777GGGAAAAG7AGCCGC77GC7A 
C777AAC7A7GGAAGAGCAGGGCCACAG7GAGA7GGAAA7AA7CCCA7CAGAG7C7CACC 
CCCACA7TCAAT7AC7GAAAAGCAA7CGGGAAC7TC7GG7CACTCACA7CCGCAA7ACTC 
AG7G7C7GG7GGACAAC77GC7GAAGAA7GAC7AC77C7CGGCCGAAGA7GCGGAGA77G 
7G7G7GCC7GCCCCACCCAGCC7GACAAGG7CCGCAAAA77C7GGACC7GG7ACAGAGCA 
AGGGCGAGGAGG7G7CCGAG77C77CC7C7AC77GC7CCAGCAACTCGCAGA7GCC7ACG 
TGGACC7CAGGCC77GGC7GC7C-GAGA7CGGC77C7CCCC77CCC7GC7CAC7CAGAGCA 
AAG7CG7GG7CAACAC7GACCCAGG7AGGAG7CAGCCCCAGCAAGACCGCAGGCACCAGT 
GCAAGCAGGGCCC7GGGGGG777GG7AA7GGC7GGGCCAGCCC7GAG7GCCACC7CAGGA 
AGCAGGCCCAGGTGC7A7777GA777TAGAAAGGAACAGC7GAATCC7G7C7CCCAAG7G 
CAGCCCAGG7GGC7GCGA77GAAC7GCCCACACC7CGA7GG7C7GG777A7AGAGGGGCC 
777GGAAG7A7GGGAA7GGCC7G7G77C7GACCCC7TGC77TCTTCC7A77C7GACA7AT 
G7AGACA77T7AATGG77GCACAAA77CAAGG77G7A77T7T7777C777AAAAAAA7C7 
T7AGC7GGACA7GG7AGCACACACC7G7AG77CCAGC7AC7CAGGAGGC7GAGGCAAGAG 
3AC7GC7TGAGCCCCAGAG7C7AAGGC7GCAGCGAGC7A7GA77GTGCCCC7ACAC7CCA 
CAG C C7GGG77T7AGAG7GAGAC C C7G7C7C7AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAANGGGCGG (SEQ ID NO:4q) 
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CCACGCGTCCGCGGACCGCGAGCGGTAGCGCCCTCCC7CCCAGCTGTTGTCCCGCCCGAT 
CCGCGACCCTAGTCCCCGGATCCCC7TGCTGAGAGTCACCGTACTCCAGGGCCAACTGAG 
CCAAAGTCCTGCCAACTTGGGTCAGCAATGAAAGGCAGGATCCTGGGTGGTGGCCCTGAA 
TCCTGATTTGTCTGCCCTGCCL\GCGAGACACATGTGGTCAAAGATGAATTTGAGAAAAGT 
AGCTGCTGGCTACTTGAACAATGGAGGAACACGGCCATCATGAGATGGAAGGCACCCCAT 
TGGGTTGTCACTC CCACATTAAACTG C7GAAGATCAACAGGGAACATCTGGTCACCAACA 
TTCGGAACACTCAGTGTCTGG7GGACAACTTGGTGGAGAATGGCTACTTCTCAGCCGAAG 
ATG CAGAGATTGTG7GTGC C7G7CCCAC CAAGCC7GACAAGGTCCGAAAGATCCTTGACC 
7GG7GCAGAGCAAAGGCC-AGGAGG7GTCTGAGTTC77CC7CTACGTGC7GCAGCAGCTGG 
AGGATGCTTACGTGGACC7CAGGC7G7GGCTCTCAGAAATTGGCTTCTCCCCTTCCCAGC 
7CA7TCGGACCZAAAACTA7CG7CAA7AC7GACCCAGTAAGCAGGTATACCCAACAGC7GC 
GACACCAACTGGGCCGCGAC7GGAAG77CATGC7GTGCTACGCCCAGAAGGAGGACCTGC 
TGCTGGAGGAGACC7A7ATGGACACAC7CA7GGGGCTGGTAGGCTTCAACAATGAAAACC 
7GGGCAGCCTAGGAGGCC7GGAT7GCC7GCTGGACCACAGTACGGGCGTCCTCAACGAGC 
ATGGCGAGACTGTC7TCG7G7TCGGGGACGCGGGAGTGGGCAAGTCCATGCTGCTGCAGA 
GG77GCAGAGCC7C7GGGCGTCAGGCAGG77GACC7CCACAGCCAAATTCT7C77CCACT 
7CCGC7GCCGCA7GTTCAGCTGC77CAAGGAGAGCGACATGCTGAGTCTGCAGGACCTGC 
7CTTCAAGCATTTC7GC7ACCCGGAGCAGGACCCCGAGGAGGTGTTCTCCTTCTTGCTGC 
GCTT7CCCCACACAGCGC7C77CACT7T7GACGGCCTGGA7GAGCTGCAC7CAGAC77CG 
ACCTGAGCCGCGTGCCGGA7AGC7GC7GCCCCTGGGAGCCGGCTCACCCTCTGGTCC7GC 
TGGCTAACCTCC7AAG7GGGAGGC7GCTCAAGGG7GCCGGCAAATTGCTCAC7GC7CGCA 
7AGGCG7GGAGG7CCCCCGCCAGC7CC7GCGCAAAAAGG7GCTGCTCCGGGGC7TC7CCC 
CAAG7CACCTGCGCGCC7A7GGCCGCCGGA7G77CCCCGAGCGCACAGCGCAGGAGCATC 
7GCTGCAGCAGC7GGATGCCAACCCCAACCTCTGCAGCCTGTGCGGGGTGCCGCTCTTCT 
G7TGGATCATCTTCCG7TGT7TCCAGCACTTCCAGACGGTCTTCGAGGGC7CCTC77CAC 
AG77GCCGGACTGTGC7GTGACCC7GACCGA7G7CTTTCTGCTGG7CACTGAGG7GCATC 
7GAACAGGCCGCAGCCCAGCAGCC7GG7GCAGCGCAACACGCGCAGCCCGGCGGAAACCC 
7ACGTGCAGGCTGGCGCACGC7GCATGCGC7GGGAGAGGTGGCTCACCGAGGCACCGACA 
AGAGCC7CTTTG7GT77GGCCAGGAGGAGG7GCAGGCG7CGAAGC7GCAGGAAGGAGATC 
7GCAGCTGGGCTTCCTGCGGGC777GCCCGATGTGGGCCCTGAGCAGGGCCAGTC7TACG 
AA7TT7TCCACC7TACGC7GCAGGCC7TC77CACCGCC7TCTTCCTGG7AGCAGATGACA 
AAG7GAGCACCCGGGAG77GC7GAGG7TCTTTCGAGAATGGACGTCTCC7GGAGAGGCAA 
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CAAGC7CG7CC7GCCA77C77CC77CT7C7CC77CCAG7GCC7GGGCGGCAGAAGCCGG7 
TGGGCCC7GA7CC777CAGGAACAAAGA7CAC77CCAG77CACCAACC7C77CG7G7GCG 
GGC7AC7GGCCAAAGCCCGACAGAAAC7CC77CGGCAGC7GG7GCCCAAGGC7ATCC7GA 
GGAGGAAGCGCAAGGCCC7G7GGGC7CACC7G777GC7AGCC7GCGC7CC7AC77GAAGA 
GCCTACCTCGGGTCCAG7CTGGAGGCTTTAACCAGGTGCATGCCATGCCCACATTCCTGT 
GGA7GCTGCGC7GCA7C7A7GAGACGCAGAGCCAGAAGG7GGGGCGCC7CGCCGCCAGGG 
GCATCAGTGCGGACTACCTCAAGCTGGCCTTTTGCAACGCTTGCTCTGCGGACTGCAGCG 
CCCTGTCCTTCGTCCTGCATCACTTCCACAGGCAGCTGGCCCTAGACC7GGACAACAACA 
ACCTCAATGACTATGGGGTGCAGGAGCTGCAGCCTTGCTTTAGCCGTCTCACGGTTATCA 
GACTCAGCG7CAACCAGA7CACCGACACGGGGGTGAAGG7GCTATGTGAGGAACTGACCA 
AGTATAAGATCGTGACC-TTCCTGGGTTTATACAACAACCAGATAACTGATATCGGAGCCA 
GGTATGTGGCCGAAATCCTGGATGAATGCAGAGGCCTCAAGCACCTTAAACTAGGGAAAA 
ACAGAATAACAAG7GAGGGCGGGAAG7G7G7GGC777GGCTGTGAAGAACAGCACCTCCA 
TCGTTGATGTTGGGATGTGGGGTAATCAGATTGGAGACGAAGGGGCAAAGGCCTTCGCAG 
AGGCATTGAAGGACCACCCCAGCCTGACCACTCTCAGTCTTGCATTCAATGGCATCTCTC 
CGGAGGGAGGGAAGAGCCTTGCGCAGGCCCTGAAGCAGAACACCACACTGACAGTAATCT 
GGCTGACCAAAAATGAACTTAATGATGAGTCTGCAGAGTGCTTCGCTGAGATGCTGAGAG 
TGAACCAGACGCTACGGCATTTATGGCTGATCCAGAATCGCATCACAGCCAAGGGGACAG 
CGCAGCTGGCGAGGGCACTGCAGAAGAACACAGCCATAACAGAGATTTGTCTCAATGGAA 
ACTTGATTAAGCGGGAGGAGGCCAAAGTCTTCGAGAATGAGAAGAGAATCATCTGCTTCT 
GACGGACGCTCCTGGGCAGGATCTTTGTCCTAGGTTGCTCCTCAGTCACAGACAGCACTG 
TGCAG7CAGCAGGG7AGCAGGA7GC7G7GCAGCGCC7GCAGCAAGG7GCC7G7CAGGAGC 
CCACAGC7CCACAGTGCACACCGA7G7CCCC7GC7CA7GC77GGAC7GGTAGCACCCGCG 
CCGCGGCTGAGACCC7GCAGACGCAGGGAG7CTTAGGAACCA7CGTCACCACTCAAAGCC 
AGCAGGGCA7C77C7G7ACAAAGA7C7CCC7GCATA7CCACTAGACGGAAGC7GAAGGAA 
CGCAACAGCAGAGGAGGCCAACAGACGCC7GGC7GAAGGC7CCG7GGGACCAACGG7G7C 
ACC7TCAGAAAAGAGC7GGGAACT7GAGCAGAGCCGATGG7AAC7TC77GGGGAAAGAAG 
GCACCCAGTGACTGCA7GG77A7TC7GAG7CCTCCTTCCTC7GCTTAG7CCCTCTCACTG 
7ACAGG7CTG7TTCTTCCTCGCAGC7GTGGC7GCTGAAGTAGGTCCACTGTGGGGAGAGC 
7CATCACAGACTTTGG77CGGTTCTGGATTCTCAGTGGTGGCAACCGAGAG7CAGACGAT 
ACCC7C7AGG7CAG7CTCAGAGGA7C7C7A7GCTG7GAGAGGG77GAGGGCCCACCCAGA 
ATTT777TT7TTTACCAGTT777AC7G7GCC7GCCCCAGGAGGGAGAA7TACTTCCCAGC 
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CTCCACAGCAGCAGGCATGGC7TGCC7CAATGGTCCTGAGATCCCAACAAAACTCTC7CC 
CTTGCCTGTGAGCAGAAAGTATCTTCATGTCCTCAGAAGTTGGAGGGTGACTGGACACAG 
TTAAGACTCAGAGAGCCAGCTGATAGCT'CAAAGCAAAGCATGGCACATACCCACCACCAT 
ACCATGGTGCGCATGGGA7GGGACAGTTGGAATGTTGCAGATAACGTGTTCTTTTGCCAG 
TTCATTTGTTAATAAAATATTTAAAACGTTAAAAAAAAAAAAAAAAAAAAAAAAAGGGCG 
G (SEQ ID NO: 42 ) 
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>1EEHGKKEMEGT?LGCHSKIKLLK:INREHLVTNIRKTQCLVDNLLENGY?3A2DAEIVCA 
C?TKPDKVRKIIJDLVQSKGEE^/SEF?LY^7LQQLEDAYVDLRLWLSEIGFS?SQLIRTKTI 
•/NTDPVSRYTQQLRHQLGRDSKFMLCYAQKEDLLLEETYI^TI^GLVGFNNENLGSLGGL 
3CLLDHSTGVLNEHGETVFVFGDAGVGKSMLLQRLQSLWASGRLTSTAKFFFHFRCRMFS 
CFKESDMLSLQDLLFKHFCYPEQDPEEVFSFLLRFPHTALFTFDGLDELHSDFDLSRVPD 
3CCPWEPAHPLVLIJ^L5GRLLKGAGKLL7ARTGVEVPRQLLRKKVLLRGFSPSHLRAY 
.\RRMFPERTAQEHLLQQLDANPNLCSLCGVPLFCWIIFRCFQHFQTVFEGSSSQLFDCAV 
TLTDVFLLYTEVHLNRPQ P S S LVQRNTRS P AETLRAGWRTLHALGEVAHRGTDKSLFVFG 
QEEVQASKLQEGDLQLGFLRALPDVGFEQGQSYEFFHLTLQAFFTAFFLVADDKVSTREL 
LRFFP.EWTSPGEATSSSCKSSFFSFQCLGGRSRLGPDPFRNKDHFQFTNLFVCGLLAKAR 
QKLLRQLVP KAI LRRKRKALWAHLFAS LRS YLKSLPRVQSGGFNQVHAMPTFLWMLRC I Y 
ZTQSQKVGRIJiARGISADYLKIAFCNACSADCSALSFVLK^ 

QELQPCFSRLTVIRLSVNQITDTGVIC/LCEELTKYKrVTFLGLYNNQITDIGARYVAQIL 
DECRGLKHLKLGKNRITSEGGKCVAIAVKNSTSIVDVG^IWGNQIGDEGAKAFAEALKDHP 
SLTTLSIJ^FNGISPEGGKSIAQALKQMTTLTVIWLTKNELlOTESAECFAEMLRVNQTlaRH 
LWLIQmiTAKGTAQIARALQKNTAITEICLNGNLIKPEEAKVFENEKRIICF 
(SEQ ID NO: 43) 
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gatcatcgttcactgcagccttgaactcttgtgctcatgtgatcctcctgccttagcctccccaa 

tagctgggactacaggtgcgccaccatgcctggctaattttttttatttttgtagagatgggtgt 

ctcactatgttgcacaggttggtctcaaactactggccttacttcaagctatctacccatctcag 

cctcccaaagcgctgggattacagtcatgagccaacttgcctggccagataaaggtcttaagcat 

ggttccttcctgctctaggtagagaaaccccacaaccagtgggaggtggggtgagctctttctgt 

agcttttgctttgctgatgaegtcattgatctcttcaggggctgcgcagagtagcaggggccctg 

gagggcgcggcctgaatcctgattgcccttctgctgagaggacacacgcagctgaagatgaattt 

gggaaaagtagccgcttgctactttaactatggaagagcagggccacagtgagatggaaataatc 

ccatcagagtctcacccccacattcaattactgaaaagcaatcgggaacttctggtcactcacat 

ccgcaatactcagtgtctggtggacaacttgctgaagaatgactacttctcggccgaagatgcgg 

agattgtgtgtgcctgccccacccagcctgacaaggtgccccggggacagggacgggcatggcat 

tgtgtggaccccgggagctagaagaggcctctccctgctgatctgagtgaagagcgtgggagttt 

agtccagcgggcagggctgcattttggggtactaatagcacacaaatgcctgggttagcaggttg 

cacagtcaggtattttacttctgtgtttgtgtctggagcaaaccctgacatctcagttctcattg 

ctgtgtgtattggttcccagacacttcatttttagatcccctttaaattaggagggaaaaagaac 

ataagcataagagcatccccagcagcgatgttcattcagtgcctctgaaggctggagggctgctt 

gttgctgggtgagactcggaggggaaccgactcagggtcaggaatgatgacatcccacggtgggt 

ccacagtgaagaatcttccccgctccactgtgggacgccttaacagcccttacttccacttacgc 

tttgcgttatctcctgaaaaataaaatggagaccacaaattccttcttggttagaggaatgacac 

aactcatttatgacatgaccccgctgggactcagaagagaccaggacggtttctgggggaagcag 

tagcacactcgtgtgctttgttctcttctcttgatttgttttcccacatttttaacaagaaaaaa 

agccgtttttaatatatggcctatcgccctcctactgtgtggcccaggtgcctacctcattatgc 

ccaaggggtggttctcacctctccactctcattcctgcacagcagttgtgtcaggttaagaggga 

caaggagaaggctgggcaccgtggctcacgcctgtaatcccagcactttgggaggccgaggcagg 

cagatcacctaaggtcaggagtttgagaccagcctggccaacatggggaaaacccgtctctaata 

aaaacacaaaaattagtcgggcatggtggtgggtgcctgtaatcccagccacttgggaggctgag 

gaaagagaattccttgaacctgggaggtggaggttgcagtgagccaagattgtgccattgcactc 

cagcccfcccagcctgggtgacagagcaagactctgtctcaaaaaagaaaaaaaaaaaaaagaggt 

agagaagtccatggctatttgtctgtcctttttatttttaggctcatggaagcctcctggtttct 

tagagctgagtggttttatttcttgctcaggaggtcatttcacagattttcgggctccaatatgt 

tgactgtcacagcagctggggggatggcatagctaccggctgtactaagaactcagagccctgcc 

ctgagcctgcctgagggtccttatggtaggaggatgcccctcatgccagcccgtgccctcatgct 

tgtgtcacctccaggtccgcaaaattctggacctggtacagagcaagggcgaggaggtgtccgag 

ttcttcctctacttgctccagcaactcgcagatgcctacgtggacctcaggccttggctgctgga 

gatcggcttctccccttccctgctcactcagagcaaagtcgtggtcaacactgacccaggtagga 

gtcagccccagcaagaccgcaggcaccagtgcaagcagggccctggggggtttggtaatggctgg 

gccagccctgagtgccacctcaggaagcaggcccaggtgctattttgattttagaaaggaacagc 

tgaatcctgtctcccaagtgcagcccaggtggctgcgattgaactgcccacacctcgatggtctg 

gtttatagaggggcctttggaagtatgggaatggcctgtgttctgaccccttgctttcttcctat 

tctgacatatgtagacattttaatggttgcacaaattcaaggttgtatttttttttcttttaaaa 

aaatctttagctggacatggtagcacacacctgtagttccagctactcaggaggctgaggcaaga 

ggactgcttgagccccagagtctaaggctgcagcgagctatgattgtgcccctacactccagcct 

gggtgacagagtgagaccctgtctctaaaaaaggaaagaaaaaaattaaaaagccttgccaggtt 

tgattctaggcaaagtattctgtcaccgttgagtgccagtccttatttccaaactaatggaagac 

cccatcagttaactgattagttcaataagtattttttgctgtatccaccacatgccaagacccta 

cactgtgctggatgtcagggagacagtggtgagcagacacagacagggttcctgccctcagggag 

cttcaagtcagctggaagagaccaccagtcagcaatctcaaaaatgtgtcaggacagcggcagtc 

caaggcatgtgagaacatatcattagggccaggatctgctctggggcaggagtcttctttccctg 

cttttgaactctccactttgagacagctgttggtaacataccagcaccaaggacctaagtcctgc 

cttttaaagaatccaatatgttgttggaaacagaagcacaagacaggtgtgtgcttaggggaaac 

aaggccagccggcagagtgtcagtgctaggctccagcttccacagcccctgcaggtgcctgccag 

ccactgctagcttctgactctgtctgctccttcctgtctccccttgtttccttcccccatgaaaa 

aaaaagaaagtattcccatgaggaatcattcttccgaaagacttctctgttggttccgttagcca 

gctactttactagcttttacagcgtaattcactctacaagcagtctcacacaaaagactacatat 
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tgtatgattctgtttatatgaaatgtccagaaaaggtaaatctatagacaaagcaaatcagtagt 

tgcctacggcccagggattggctacaaataggctccagaaaactctgggaagatggtagagatgt 

tctagacctggactgtggtgaggtttgcacaactttgtaaacttactaaaaattactgacaaata 

tataacactccccaacactttgggaggccgaggtgggcagatcgcttgaacccaggaatttgaga 

ccagcctgggcaacatggcgagaccccgtctctacaaaaaaacacaaaaattagttgggcttggt 

ggcatatgcctgtgtcccagctacttgggaggctgaggtgggaggattgcttgagcctgggagtt 

tgagactgcatgattgggtcactgcaccctagcctgagtgacagagcaggaccctatctctaaca 

acaaaaaagcagtgttggtggaggagggccagcgcggccatctggcctggccctcgagtgcgagg 

ggcttcagtgtttagctgcagttcagtgatgacactgtgcggaggaataagggtggcctgtctca 

gacactgatcccagctgaagtttgtcaccttctttctggcaaatctgaggtcaagcagagagatc 

aaagcctggggccctcagggtcaggaatgctggctctgtgacgctccccaggtcctgcatctgag 

gagtggctgcgctggcctcagggcccaggttgtgaattttgtttatgcactcgcctctcctcttt 

gagacctccctgtttgatgctgtttctgcctctctcctcaccctgctgctgtgccctgccacccc 

ctccctccagtgagcaggtatacccagcagctgcgacaccatctgggccgtgactccaagttcgt 

gctgtgctatgcccagaaggaggagctgctgctggaggagatctacatggacaccatcatggagc 

tggctggcttcagcaatgagagcctgggcagcctgaacagcctggcctgcctcctggaccacacc 

accggcatcctcaatgagcagggtgagaccatcttcatcctgggtgatgctggggtgggcaagtc 

cacgccgctacagcggctgcagagcccccgggccacgggccggctagacgcaggggtcaaattct 

tcttccactttcgctgccgcatgttcagctgcttcaaggaaagtgacaggctgtgtctgcaggac 

ctgcccttcaagcactactgctacccagagcgggaccccgaggaggtgtttgccttcctgctgcg 

cttcccccacgcggccctcttcaccttcgatggcctggacgagctgcactcggacttggacctga 

gccgcgtgcctgacagctcctgcccctgggagcctgcccaccccctggtcttgctggccaacctg 

ctcagtgggaagctgctcaagggggctagcaagctgctcacagcccgcacaggcatcgaggtccc 

gcgccagttcctgcggaagaaggtgcttctccggggcttctcccccagccacctgcgcgcctatg 

ccaggaggatgttccccgagcgggccctgcaggaccgcctgctgagccagctggaggccaacccc 

aacctctgcagcctgtgctctgtgcccctcttctgctggatcatcttccggtgcttccagcactt 

ccgtgctgcctttgaaggctcaccacagctgcccgactgcacgatgaccctgacagatgtcttcc 

tcctggtcactgaggtccatctgaacaggatgcagcccagcagcctggtgcagcggaacacacgc 

agcccagtggagaccctccacgccggccgggacactctgtgctcgctggggcaggtggcccaccg 

gggcatggagaagagcctctttgtcttcacccaggaggaggtgcaggcctccgggctgcaggaga 

gagacatgcagctgggcttcctgcgggctttgccggagctgggccccgggggtgaccagcagtcc 

tatgagtttttccacctcaccccccaggccttctttacagccttcttcctcgtgctggacgacag 

ggtgggcactcaggagctgctcaggttcttccaggagtggatgccccctgcgggggcagcgacca 

cgccctgctatcctcccttcctcccgttccagtgcctgcagggcagtggtccggcgcgggaagac 

ctcttcaagaacaaggatcacttccagttcaccaacctcttcctgtgcgggctgttgtccaaagc 

caaacagaaactccttcggcatctggtgcccgcggcagccctgaggagaaagcgcaaggccctgt 

gggcacacctgttttccagcctgcggggctacctgaagagcctgccccgcgttcaggtcgaaagc 

ttcaaccaggtgcaggccatgcccacgctcatctggatgctgcgctgcatctacgagacacagag 

ccagaaggtggggcagctggcggccaggggcatctgcgccaactacctcaagctgacctactgca 

acgcctgctcggccgactgcagcgccctctccttcgtcctgcatcacctccccaagcggctggcc 

ctagacctagacaacaacaatctcaacgactacggcgtgcgggagctgcagccctgcttcagccg 

cctcactgttctcaggtgaggctgccaggcaaggggagcaacaggtgggccgggcgggccaggct 

cggagggcatcgggaatggcatcatggaccaggatcccccaggactcatgaccatggcccttgga 

atgtccagaccttttctttcttagcagggcagaggtcaaggtgcaaagcttcgaggcaggtggac 

ctggatcagccacagctgggtgcccttgaacaaagtgcttaactcccagagcctccacgccctca 

tctggaaaaagaagatgctcataatcctaccaattatggccacagggaccaatgttagttgagaa 

tgggtgaagtgcattacaaatattacctaatggaatgctctttacaaccctgtaacttaggtact 

gttattgtctctattttggcagataaggaagtagaggcacagagaagttaatagcttgctttagg 

tcacacagctcagacatagcagcgccagaa-gcataaagaaccttccttttaagattaatgtaag 

gctccgagacagccctcaaaaagtttct.ggaatatgggagcttrcattactgcagagaaagcaga 

ccttgtgccagctggcactggtgactttctgtgatcaacgccagcagcccttcacactgctagag 

acctcagttaaaatgctgactcgtggttgttttcccgttccatagcttacgggaaacagagccca 

gtctgttttcttctattagcatttcctacgtaaaataaaccccgtaaatctctacagagaa'rtaa 

atttgccattacttgactcacgcatccctaaaaagcagtagggatttggaactgactcccagtgc 
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ctgtcacaccagtgtcagagtgtaaataattgcatggggacatggggtgcagggggtcgaaggct 

gccctagcctgggaattggaaaacctggagtctgttctctgtactctcagccagtgactctccct 

ctgtagccccaggcagtctcacactcagtgccaccctctgtccatcttttttttttctcccccaa 

atggagtcccgctctgttgcccaggctggagtgcagtggcgtgatctcagctcactgcaacctcc 

gcctcctgggttcaagcgattctcttgccccagcctcctgagtagctgggattacaggcacacgc 

caccatgtccggctaagttttttgtatttttagtaggacggggtttccccatgttggccaggctg 

gtcttgaaatcctgacctcaggtgatccgcccgcctcggccttccaaaatgttggggttacaggc 

atgagccgccgcacccgacccctctgtccatcttttcaatgggaaactccacaccagtgtggtgg 

ccctgcccttcctgctgtccccaggtgaagctttccttcacaccagtgcaagaaaaaacagcttg 

taggaaagcagaggatatgggtaaccacgggaagcacactcagttctctggctgcatcagttagg 

attagttttagctgagagcgaaaaccccaaatgttggtgagttacaagcttatttctctcatgta 

aaagtctagaggtaggtagttcaggactggtatggagtctccatgaccctccggagcccaggctc 

tcttctgccttcctgttctgccatcctcactacccggctttcccatcttggcccaagagggctgc 

tcaaactccagccatctagtcgacactctagctatcagtaagaaggaagggcaaagattgagagc 

atgcctcaatcttttaagaacacttcttggctattactaattatattgctgcttagatttcagaa 

cttaatggtatgggcagaatttaatgagatgggcccagctaaaagatgggggaatctattgctaa 

gaaagtatagatattgggaatgtctagcagcctgtgctgtcttgggctggccatgccatgtacat 

acacactatttcccagcaccaagctggggactctgagggaaagggtccagagtgtctgacttgat 

cat tttgatgtggcctaaaaatcaagcttttaattgttcagccttt tact tgt tat caaggtcag 

cttgtgggtctaattgggcccaaggcttgtgtttctaagtaaagttttattggaacgcagccata 

cccatttatttacttactggctgcttcacactacacagttgagtagctgtgacagagaccacatg 

gcccacagagcctaaaatatttgctgtctgacactttacagaatgacatgagcagtctcctttga 

cagtgggactcacagccttttccagtgacaaatcagggttagcccatgtgtttctggatgggggg 

aagctgttggcattttgggtataacagttcttgtgagacctgtccagcattttgcaggacaccta 

acatcattggccctgcctgcaagatgacagggcactccctcctccagtcacaaccactaaaagca 

gcccctgacatttccaaacccatgccctccaccatacgagaaccaggtacagggtctggctgaca 

cataggtcacacgcaaagggtggatgtcagaggtggctggcctcacacgtcctccctgtgtcctt 

cacggtcgtgtgaggagccaggggctgtgctgcagcctcgctcatgggctggtgcaggatgggtc 

tggcggccccacgttggccaggctttgtaaggggctatttggctgattgctgtggccattctcca 

ggggcgtctatacctgagaaaactccagggcctgaaggcttctggatctttgtaagattaatggt 

ccttcataatgagtgcctgccctgactcgtaatttttttgctgttttatttcagactcagcgtaa 

accagatcactgacggtggggtaaaggtgctaagcgaagagctgaccaaatacaaaattgtgacc 

tatttggggtatgtctttctccagaacactgggccaactacctagtaataatacagagctgcagg 

gaattcacattcccataggtccctggatgatcggcacggatggcccagggctgggaagagcgctg 

gcccaggagttgagagtcctgggttctctttgtggctcggccagtcatgaagtcttgctgagcct 

cagcctcctcacctgtaaaactgggatcccagtataggcaagtaggcttacaactggttattggg 

ggatgcaacgagaatataaggggatatatttaataaatgctagaatcctgtttacatattagtct 

ggactattttgggtccataatccctcatccagagcctttggggcaagacccgaatggggattctg 

agtgcatgctatggcatgacgtggccgcaggggtctaaggcagtgccccattttcaaacactttc 

atatttctcccgcagaatgtatgaaacagtcaaaccaagtgtggtaagaaagactataagtagct 

ccacatcagttgccaaaagaattgtgagaaactttgggcattcagagcctttgaggttttggagt 

ctgagagaagggattgcgggccagccccacacaactggtggctctgcaagctggagcagttgttc 

agtttcttggggcctcagtggccttcgatgttaatgaggacatggacgcaaacgaccccgggcca 

cactcggctccagggctctgtgtggctgtggaaccctggaagcctgagcttagctgcctttcaac 

ttccatctgctgtactattgaattggcattgagcggtgagatggctgaaaggtagacatcgagaa 

gttttaatattcagaatcttttcttctcaagacgctgaatgtaatcttagttgtaaatacccatc 

acctgccagtcaccgagcactcatgcaccagggctttgcgttatgtcctaagatcctcataacca 

ccctgcaaggggactatcatcattacctctgtattacagatggagaaactgaggcacagagaggt 

aacgtgacttgtctcaggccataaagctggggaaagtagtggagctggttttgaacctgagctgt 

gagacctcagagccctaaactctggtgcctgtgtgttcccctttcaacccagactttggaaatca 

gtagacaccatatgcttcaaaaaacaggggctattaaaatgacatcaggagccagaaagtctcat 

ggctgtgctttctcttgaagtttatacaacaaccagatcaccgatgtcggagccaggtacgtcac 

caaaatcctggatgaatgcaaaggcctcacgcatcttaagtaagtggggtaggcaccaggttcct 

tagtatattctcttgatcacccccttctgttgttcaaagattaaatgtcacagtaaagagctttc 
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atcctaaagccttccacttgtcccagggccatgttggtcaagtaaagatacctctgtgtgatctg 
tgaggcttggattctggaagggcctcccgttattggtagggggaaaggttggcattttgatttca 
ttaactactaggccgaagaaaggactaactctcaccctttctggtggtctttttgccccaaggga 
gtttcctgtcgggttgcaaggaagagcttgggcccttgccctgctgtaggtgtgccctgcgcagg 



agaggaccccagggagggctcaggtccctgagcttctgcagagactgtggaccatctcctggaga 




aggagggtgacccacattggagcccctgcatagctggaggctgactgtgtgtgactctctctgca 

gactgggaaaaaacaaaataacaagtgaaggagggaagtatctcgccctggctgtgaagaacagc 

aaatcaatctctgaggttgggtgagtagaaggggatggatgtatgtggtacaacctgctgtgtgt 

gtggggggcgggccttgctgttcttttcatacatcagtacaccagaaggaccactggggctcgct 

gtcggggagagatagtggagagctttcaccatgctgcgaaactgaaaccgtgcccattaagcaat 

aactccccggtccccctcccccctgcctcttgcagccaccctgctacttactctctctatggttt 

tgactactctacctcatgtaagtggaatcatacagtatttgccttttggggatggctgatttcac 

tagcatcatgtcctcaagattcgtccacatggaagcatgggacaggatttccttttttttttttt 

ttttttttttttttgacagagtctcgctctgttgcccaggctggagtgcagtggcatgatctcgg 

ctcactgcaacctctgccttctgggttcaagcgattctctcgcctcagccacacgagtagctggg 

attataggcacccgccaccaatcccagctaatttttgtatttttagtagaggcggggtttcacca 

tgttggccaggctggtctcaaacccctgacctcaaatgatccacccacctcggtctcccaaagtg 

tcaggattataggcgtgagccaccgtgccccgccaggatttccttcttttttaaggctgagtaat 

actccattgcatggctatgccacattttgtttactcattcatccaagaacagacactggcttgct 

tctatgctttggctgttgtgaataatgctgctgtgcacatgggcatacaaatgtctcttcaagga 

ctgccttcaattcttttttttttttttttttttttttagattctttttttttttattatactcta 

agttttagggcacatgtgcacattgtgcaggttagttacatatgtatacatgtgccatgctggtg 

cgctgcacccactaatgtgtcatctagcattaggtatatctcccaatgctatccctcccccctcc 

cccgaccccaccacagtccccagagtgtgatattccccttcctgtgtccatgtgatctcattgtt 

caattcccacctatgagtgagaatatgcggtgtttggttttttgttcttgcgatagtttactgag 

aatgatggtttccaatttcatccatgtccctacaaaggatatgaactcatcattttttatggctg 

catagtattccatggtgtatatgtgccacattttcttaatccagtctatcattgttggacatttg 

ggttggttccaagtctttgctattgtgaatagtgccacaataaacatacgtgtgcatgtgtcttt 

atagcagcatgacttatactcatttgggtatacacccagtaacgggatggctgggtcaaatggta 

cttccagttctagatccctgaggaatcgccacactgacttccacaatggttgaactagtttacag 

tcccaccaacagtgtaaaagtgttcctatttctccgcatcctctccagcacctgctgtttcctga 

ctttttaatgattgccattctaactggtgtgagatgatatctcatagtggttttgatttgcattt 

ctctgatggccagtgatgatgagcatttcttcatgtgttttttggctgcataaatgtcttctttt 

gagaagtgtctgttcatgtccttcgcccactttttgatggggttgtttgtttttttcttgtaaat 

ttgtttgagttcattgtagattctggatattagccctttgtcagatgagtaggttgcgaaaattt 

tctcccatgttgtaggttgcctgttcactctgatggtagcttcttttgctgtgcagaagctcttt 

agtttaattagatcccatttgtcaattttgtcttttgttgccattgcttttggtgttttggacat 

gaagtccttgcccacgcctatgtcctgaatggtaatgcctaggttttcttctagggtttttatgg 

ttttaggtttaacgtttaaatctttaatccatcttgaattgatttttgtataaggtgtaaggaag 

ggatccagtttcagctttctacatatggctagccagttttcccagcaccatttattaaataggga 

atcctttccccattgcttgtttttctcaggtttgtcaaagatcagatagttgtagatatgcggca 

ttatttctgagggctctgttctgttccattgatctatatctctgttttggtaccagtaccatgct 

gttttggttactgtagccttgtagtatagtttgaagtcaggtagtgtgatgcctccagctttgtt 

cttttggcttaggattgacttggcgatgcgggctcttttttggttccatatgaactttaaagtag 

ttttttccaattctgtgaagaaagccattggtagcttgacggggatggcattgaatctgtaaatt 

accttgggcagtatggccattttcacgatattgattcttcctacccatgagcatggaatgttctt 

ccatttgtttgtgtcctcctttatttccttgagcagtggtttgtagttctccttgaagaggtcct 

tcacatcccttgtaagttggattcctaggtattttattctctttgaagcaattgtgaatgggagt 

ccacccatgatttggctctctgtttgtctgttgttggtgtacaagaatgcttgtgatttttgtac 

attgattttgtatcctgagaccttgctgaagttgcttatcagcttaaggagatcttgggctgaga 
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cgatggggttttctagataaacaatcatgtcgtctgcaaacagggacaatttgacttcctctttt 
cctaattgaataccctttatttccttctcctgcctgattgccctggccagaacttccaacactat 
gttgaataggagcggtgagagagggcatccctgtcttgtgccagttttcaaagggaatgcttcca 
gtttttgcccattcagtatgatattggctgtgggtttgtcatagatagctcttattattttgaaa 
tacgtcccatcaatacctaatttattgagagtttttagcatgaagggttgttgaattttgtcaaa 
ggctttttctgcatctattgagataatcatgtggtttttgtctttggctctgtttatatgctgga 
ttacatttattgatttgcgtatattgaaccagccttgcatcccagggatgaagcccacttgatca 
tggtggataagctttttgatgtgctgctggattcggtttgccagtattttattgaggatttttg'c 
atcaatgttcatcaaggatattggtctaaaattctcttttttggttgtgtctctgcccggctttg 
gtatcagaatgatgctggcctcataaaatgagttagggaggattccctctttttctattgattgg 
aatagtttcagaaggaatggtaccagttcctccttgtacctctggtagaattcggctgtgaatcc 
atctggtcctggactctttttggttggtaaactattgattattgccacaatttcagagcctgtta 
ttggtctattcagagattcaacttcttcctggtttagtcttgggagagtgtatgtgtcgaggaat 
gtatccatttcttctagattttctagtttatttgcgtagaggtgtttgtagtattctctgatggt 
agtttgtatttctgtgggatcggtggtgatatcccctttatcattttttattgtgtctatttgat 
tcttctctctttttttctttattagtcttgctagcggtctatcaattttgttgatcctttcgaaa 
aaccagctcctggattcattgattttttgaagggttttttgtgtctctatttccttcagttctgc 
tctgattttagttatttcttgccttctgctagcttttgaatgtgtttgctcttgcttttctagtt 
cttttaattgtgatgttagggtgtcaattttggatctttcctgctttctcttgtaggcatttagt 
gctataaatttccctctacacactgctttgaatgcgtcccagagattctggtatgtggtgtcttt 
gttctcgttggtttcaaagaacatctttatttctgccttcatttcgttatgtacccagtagtcat 
tcaggagcaggttgttcagtttccatgtagttgagcggctttgagtgagattcttaatcctgagt 
tctagtttgattgcactgtggtctgagagatagtttgttataatttctgttcttttacatttgct 
gaggagagctttacttccaactatgtggtcaattttggaataggtgtggtgtggtgctgaaaaaa 
atgtatattctgttgatttggggtggagagttctgtagatgtctattaggtctgcttggtgcaga 
gctgagttcaattcctgggtatccttgttgactttctgtctcattgatctgtctaatgttgacag 
tggggtgttaaagtctcccattattaatgtgtgggagtctaagtctctttgtaggtcactgagga 
cttgctttatgaatctgggtgctcctgtattgggtgcataaatatttaggatagttagctcctct 
tgttgaattgatccctttaccattatgtaatggccttctttgtctcttttgatctttgttggttt 
aaagtctgttttatcagagactaggattgcaacccctgcctttttttgttttccattggcttggt 
agatcttcctccatccttttattttgagcctatgtgtgtctctgcacgtgagatgggtttcctga 
atacagcacactgatgggtcttgactctttatccaacttgccagtctgtgtcttttaattgcaga 
atttagtccatttatatttaaagttaatattgttatgtgtgaatttgatcctgtcattatgatgt 
tagctggcgattttgctcattagttgatgcagtttcttcctagtctcgatggtctttacattttg 
gcatgattttgcagcggctggtaccggttgttcctttccatgtttaccgcttccttcaggagctc 
ttttagggcaggcctggtggtgacaaaatctctcagcatttgcttgtctataaagtattttattt 
ctccttcacttatgaagcttagtttggctggatatgaaattctgggttgaaaattcttttcttta 
agaatgttgaatattggcccccactctcttctggcttgtagggtttctgccgagagatccgctgt 
tagtctgatgggctttcctttgagggtaacccgaactttctctctggctgcccttaacatttttt 
ccttcatttcaactttggtgaatctgacaattatgtgtcttggagttgctcttctcgaggagtat 
ctttgtggcgttctctgtatttcctgaatctgaacgttggcctgccttgctagattggggaagtt 
ctcctggataatatcctgcagagtgttttccaacttggttccattctccacatcactttcaggta 
caccaatcagacgtagatttggtcttttcacatagtcccatatttcttggaggctttgctcattt 
ct t t ttattcttttttctctaaacttcccttctcgct teat tt cat t cat ttcatcttccattgc 
tgataccctttcttccagttgatcgcatcggctcctgaggcttctgcattcttcacgtagttctc 
gagccttggttttcagctccatcagctcctttaagcacttctctgtattggttattctagttata 
cattcttctaaatttttttcaaagttttcaacttctttgcctttggtttgaatgtcctcccgtag 
ctcagagtaatttgatcgtctgaagccttcttctctcagctcgtcaaaatcattctccatccagc 
tttgttctgttgctggtgaggaactgcgttcctttggaggaggagaggcgctctgcgttttagag 
tttccagtttttctgttctgttttttccccatctttgtggttttatctacttttggtctttgatg 
atggtgatgtacagatgggttttcagtgtagatgtcctttctggttgttagttttccttctaaca 
gacaggaccctcagctgcaggtctgttggaataccctgccgtgtgaggtgtcagtgtgcctctgc 
tggggggtgcctcccagttaggctgctcgggggtcaggggtcagggacccacttgaggaggcagt 
ctgcccgttctcagatctccagctgcgtgctgggagaaccactgctctcttcaaagctgtcagac 
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agggacacttaagtctgcagaggttactgctgtctttttgtttgtctgtgccctgcccccagagg 

tggagcccacagaggcaggcaggcctccttgagctgtggtgggctccacccagttcgagct?ccc 

ggctgccttgtttacctaagcaagcctgggctatggcgggcgcccctcccccagcctcgttcrcca 

ccttgcagtttgatctcagactgctgtgctagcaatcagcgagattccgtgggcgtagqaccctc 

tgagccaggtgtgggatatagtctcgtggtgcgccgtttcttaagccggtctgaaaagcgcaata 

ttcgggtgggagtgacccgatcttccaggtgcgtccgtcacccctttctttgactcggaaagqcia 

accccccgatcccttgcgcttcccaggtgaggcaatgcctcgccctgcttcggctcgcgcacqat 

gcgcgcacacactggcctgcgcccactgtctggcgctccctagtgagatgaacccggtacctcag 

atggaaatgcagaaatcacccgtcttctgcgtcgctcacgctgggagctgtagac^ggagctgtt 

cctattcggccatcttggctcctccctccaattcttttgggtatatatccagSagtggglttict 

ggatcaca t ggtaatttttaattttttgaagaatcatcatactgttttcca?gg?aiSgcaIca 

ttttatgttcccaccaacagttcatcctagtttctccacatccttgccaacacttgctattttct 

ctttttgacagtacccatcctaatgagtgtgaggtcctgtctcattgtggttttgattcttgagg 

ctttttaaagctttttgtttcattataatttctattggattacaaaaggiacacaggtaatEtlf 

ttcggaaactatgaaaaacaataaaaattatcttctcagaaaatgattcttgttaacatttaaoc 

tcagttaagctctctcactttctctcccttctctctctttgtacaacttttaaaaaatatagtag 

gggtgagactatatgtatctatactatagcaggggtgagactatatgtatccttcctttttcact 

^^ C ^ C ; t p Ctt:9a9t:agctttccactttat:taaaaat: 9 1:Qat g cc attcaattgtatagtaa 

a ^ a S a tataaQcaaaacact 9 aaaact cttat:tctgggttccagcaagccatacctggaat 

ggcgtaagcaggtagtttgcttggtgtgaacgtgttgttgaggcagctgccattgtgttgtgaqt 

gggccacacgaacttgttctgttgtgtgtagacagtgtgtgctgatcctattaggaacagccaac 

gctttgtgtgagccacacacggttctaagtgctttgcttctgttaactcagtgaatcctcacaac 

tccatgacggaatgctctaattatccccattttatagatggggcaactgaggtccaagagactac 

ataatttcccgaagttcacacaggcagcagatggcagagccgggtcaggagtccaccatcttacc 

acgcagactgttttagccagagactctccggatctgctgtaggggacagaatacagctttatcgc 

cgcacctgtccaccaagatggccgtagccacagagcttggttgggtaacgtcctctttatgtgac 

aggaacgttgctgacggggtttctgaaggtacttcctgctctttgtctcctggaagactgtgtct 

tcaggaacgtctctgaccctgcccagagttgaacggatgctgggaacccagcacctgcacacggc 

cttccctccaggactctgcgcacctctgtgctccacaggagacatgcaggtgctttctctcatla 

gctcaggctcctgggctgacagctctccgaagctcgtggtgaggctcggtctctaactgtgccic 

ttgccgatggcctctgttcacaaggcttcccctgctcttcgatcttgcatcaccccttgaitttg 

aaatccagagcagcccactcagagaccagtgtgaggaattagtgtccaggccacagatccagggl 

ctgggcacaaacatctgcctgttgagtaggaactgagctgtggccattggcaaaaaaggagggit 



caagttggggatgaaggagcaaaagccctcgcagaggctctgcggaaccaccccagcctgacc 




-tgtcaaacctttctttgatgcataagaggccatctagtaaagcacattcttctcttttttta 
actttaagttctgggatacatgtagaagatgtgcaggtttgttacataggcaaatgcatgccatg 
gtgatttgctgcacctatcaacctgtcatctaggttttaagccctgcatgcattaggtatttgtc 
ctaacgcctgccctccccttgccccccacccccaacaggccccggtgtgtgttgttcccctccat 
gtgcccatgtgttctcattgttcaactcccacttacgagtgagaacatgcagtgtttggtttttt 
gttcctgtgttagtttgctgagaatgatggcttccagcttcatccatgtgccagcaaaggacatg 
atcccatttttttttatggttgcacagtattccacagtgtgtatgtgccacattttctttatcca 
gtctatcactgatgggcatttgggttggttccaagtctttgctattgtaaacagtgctacaataa 
acacacatgtgcctgtgtctttatagcagaatgatttataatcctttgggtaaatacccagtaat 
gggattgctgggtcaaatggcatttctggttctagatccctgaggaaccaccttaagtgtttatt 
cagcccagtgaattctgcatgtgtcccacaccagccaaccaccacccccatcaagacagaggaca 
^!:"* g " c ? tca9Ccatcccc ^ 

cgaaacttaacaagatgctggccagcagattcctgccccttccttgtcctcaggatqacqctqqa 
aaagagggactcttcctctctataaarggggatgcacctacccagcccccgcttaggctgccggc 
ZtZlZ* t,: 999acct:tggtatgcccac=7 " -cctgccgctgttcttcctac2actgl2aaagagEc 
S a !f aa9 ?^ 999gaca 9 ta 3 ca g aa gag ; - -^ccgccaggtcttgcagatggggtaccttgatcrgg 
gccagcctttagaa-gacagcv -gccaggcctcgccagcctcctgcccatgtgcagaaaccr-ag 
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gtgccgaccccagcccactgttgtgtgagcaggctgtgctgatgacccatttcccgtccagcctg 

cccttgtgctctgtgtgtgggctctggggcagcagcgcctgggcactactgctgcagctgaacac 

ttctgcatcctgccccgagtgagcctgggctggggccacagccaggcagaggcttcccagctgtt 

ctgatgttgaagctaagattgaatgtagatgtgtctttaataattcaccccaagtgtgttccttc 

ctagtcttgcgtccaacggcatctccacagaaggaggaaagagccttgcgagggccctgcagcag 

aacacgtctctagaaatactgtggtaatagctcgagtcatttcatttgtttgtttgtttttctgt 

gatagggtcttgctttgtcgtccaggcttgagtgcattggtgtgatctcagctcactgcagcctc 

cacctcccaggctcattcgaacctcccgccttggccttccgagtcctgagactataggcatgcac 

caccacacccagttaattttaaaattttttgtagagatggggttttgctatgttacccaggctgg 

tcttgaactcctgggctcaagcagttctcctgccctggcttctcaaagccctgggattgcaggtg 

tgagccactgcacctggcacagagtcattttggagggtttaggtcccaggaattatcccaggggc 

tgcacatggcctggaatcttaacagaaaaggtgtctcccaattggaaaggctctaggcctttcag 

ttaagttgataatttcctcctagagaagagaatagccacttctacaagcataaacaggtacagga 

ggaggaagtgggctccgggagcctggatctgaggccttggccttctaggccccaggagaactaga 

acgctggccatgcaagctatccaggtatccttggataccttcagatgtgcttagcagaggccaac 

ttccacacacttggctcaaaattttctcccttcctcctcttcatctgccttcccccaggcagcct 

cctccttccccaggtcttcacatcagggtttggcctttatgctccatccagctcatctgtcactt 

gtcacctgaagcccacagtcctcgctccctctctgcactctagggcacttactaagtggatgtgg 

cctcctgagagtgttttttgttggtgttcccttttttatggccacctaatgttttattttgcttt 

atttgtatttacatctctgtatcataaattccatacaggtggctgggagcagtgactcacatctg 

taatcccagtactttggaaggctgaggtgggaggatcgcttgaggccaagagttcgagactagcc 

tgggcaatatagcgagaccctctatctacaaaaaaaaaaaacattccttacaggttaagtgaggg 

agttgtattacaaccctccctatcatctactcagagcccagtgctcatttgatcttgctaaatta 

gttactgagaataatgacaatatcctcttcatgagagagttttgacattaggcctgctgtccagt 

aagtgcattttaaattctttcccctcaacaaatcatttaacattttgaaaagtagtttatgtttt 

ttggaaaaaatgtaagacactaaaggaggacatgaaagtacctcctaaagttcctgctaaaagga 

ggaagtgaaagtacctccctttgtgttttccaaaataacctttcctttctagccttttgttctat 

gtatgttcaaagatatgcaaaacagaatagcattcaagcagtggctctaaaaatattgtaatcac 

atactttacatgtctcctttagggtttctccatcttgatgctgttgacattttggtccaagtgat 

tctttattatggtagggctgtcctgtgcatcatagacggtttagccgcatctctgccctgtacct 

cccagtggtgaggatcaaaaatgcctccggacatggccaggtgccccatggagagtgaaatcaca 

tggatagtagtaatgtcaacacctagaagccctcaagtgctgactgcatgccatgtgttattcta 

cactttttccctgtgttaactcactcagcctcacaaccactctatacgatctctactgttaacgt 

tcaccagtgagaaaactcagacccaaagaacttaagcctgttgcccgaggtcaccctgctggtgg 

gtgatacaaacctgcccaggctgagcccggagtagatgtcaatgctgtgttcttctccctcctca 

ttctacctcattctccctacaagctgcacaacatctcgaatagatatcacaatatatttcatcag 

ttgtttctgatctaaatttgttcagattttacattaggataataccacaatgcatgctgcaatgt 

ataaagctttgtgtgtatatccttgcacactgtagggtaaatttctagaagtctgattgtcttaa 

aatgaagcacattaaaaatttgggcaggcacatccaaactgcccttcaaggaattttttttttta 

aatgttctttctgttctattcttcttcctaatgattctttcgtccactggcacaagtgggtccta 

ccctgtttacaccaaggagctttggtgctttatccagaccacttctggttctaaggaccattgag 

agacttcctgaactttcagtcacttaacttgggtccctcacaagttaactgagagcaaagtactg 

aacacattttaatgtgcagtcagtgactgtttcaggtcttcaaactaacttggataacacactgt 

cagtggtgttcaagggaccctgggactagaggagaactgagaagcaggcattggccctttgtttt 

ccgtgggcccccatcttccatgaaatctgagggctcagcaaaggtggggagggagggtgggctcc 

tctacaggtagctgggctaagaaataggagcccaggtacaggatttgcattaaaaatgagtccca 

ttgaccttctgtggggctgacaggctgggcttggagcctggctgttttctgggttctcagcaagt 

gatcatctgcatagctggagagccttgggctgagcccccgctcctgtgaactctaaaacaatgtc 

tgccaagtaggctctcttgagtaaatacttccttttttttccttaggctgacccaaaatgaactc 

aacgatgaagtggcagagagtttggcagaaatgttgaaagtcaaccagacgttaaagcatttatg 

gtaactcagagagccttacaatttcagactgtgctacttttcaaaagtattttttgagataaaat 

ttacatactgtaaaattcactctcttaaagtatacaattcagaggtttttagtgcaaccatcacc 

acctaattctagaacattttcactcctcctccccactccaaaaagccctggtatccattaagcag 

tcactccctgtcctcctccccagaccctggcaaccactaatccgctttctgtctctatggatttg 
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tttctgggtcatatggcaattctgtaactttttgaggagccaclllactgtt??c?afagtgla? 
g ^ a "" acattctcgcca 9C aa t9tat^^^ 

ttattattgtccatcttttaaaatctagttatactagtggatgtgaagtaatattgtgg???? 3 ! 
tttgcatttccctgatgacaacaatgttgaatgtctttttatgtgccEactgggaitllatatfa 
cttctttggagaaatgtctccatatcccttgcccattttaaaitEgggttti^^c^altacta 
ag ^ a 5 agggg " c t ctatatattct 9g^^ 

atttttgttttttttgagatggagtcttgctcttgtcacccagacaggagtgcagtggcatqatc 

tgggattacaggcatcagccaccatgcctgtctcattttgtatttEtaataia g atgggg?t a ?a 
Sa a a^ a l^?? a99Ct9gtCttgaaCtCCtgaCCtcaggtgatcc ^ 

agtgctgggattacaggcgaaaagccactgcacctggccaatagtttttaattttgatqaaatcc 
aatttatctat t tttttctttgg t tgcttgtgctttcagtgtcItatctaagaaa?ga? a Scta 
a ^^ aagatcacaaagaactccacctaa g ttt tctgttaagcgttatagttgtttcccctcaca a 
ataggtctgcaatccattttgagttaatttttgtatagtgtaaagtgaiggttaacctcattcJc 

atggccttgacacccttgtcaaaaatcaattgaccataaatgtatgggtttatttctgaattctc 

cattttgaaatcaggaggtgtgagttctactttgttcttctttctcaagattgtttaiaLaftc 
tgggttctttgcatttcttatgaattcagactcaccttgtcaatttcticaaLagiSagactc 

aaao a ^ a ^ 9ttttttCtttCCtttttagCCtgCagaattatt ^ 

aggccagcctctccagggagagcagagctaggacagggtcagaaagagagtcttggctgctttat 

fj: a "^f aacctgca " ggcccta 9W 

agg ^"" taggggcattaggt 9Ctctccttc^ 

9 CCtt = aaaaaagcctaa 3 t 9 QtQacta ^aaacagcagagtgtaaStga2taiagIa 
?^S?^ 9CCCaCttCCtggttCtatttttgtCccttttgaaa 999 aa ^cattaIct^ 
tgaacccaggggccctagcccttgtggggtatggctgggagcaccagatcctggctgcagcccaa 
ccaccagtggtcctgtgtgcttgggcagtaacagtgacaagagctcccttccc?ctggacactg? 
g ?^ aa = aCCC J CCtCttgaaatCtCacacacccagtg 9 at g9999g cact cttatagttattc^ 
cagtttacagatgacacaactgaggcacagacagatgcgtttatttcttcaaggttctcitaQcta 
aacagtggggagggagggtttaagaggagctgcacccgctctgcaatactgcctctcaEgaggga 
gtcctcttcattcatgacagcatagggccctcgtcttcctggtaagggcttccttcttgggEcaa 
tgccaggatttctaagggtcatgtttagcaggagcctattctacaaacagccaggagcagggaal 

gactctgcgatgaagcggagacactacagcctcttgatgcatttatttcctggEEgggttagaag 
^ ag ;: tgc ^ caa 99gagcatttcaggagaggcctggcttcc^^ 

ccatttgaatcactgctacccagaacaatggggtgcattctcagagtccccattattaaagcttt 

tccactgagccccatgagaactattcatgagaactatttcatggcagcataactgtttctcctcc 

ctccctcttgcatgttggtagcctcttaactttaaaacctgccttgcctttccctagctacctgg 

aaggagacgtrcagacttcctgtcccatggtgtgtttcttacaatttgttgttcagattggtcicrtc 

tcccaaatatatataaaaatataaatggagtctcactctgtcacccaggctggagtgcagtggca 
cfatcttggctcactgcaacctccacctcccagttcaagS^^ 

S ag " gggagtacaggt 9 cacacca ccatgcccagctaatttttttgtatttttaataga g aaag 
gttggccaggatggtatcgacctcctgagctcgtgatccacccacctcggcctcccaaagtgctg 
ggactacaggtgtgagccaccgcacccggccccaaatattttgattatgcacctctgcaqtgaaa 
aa *I? aaa ? aCaca f atcagttca ^ 

aaaaaatatcaagc 5 atcctttactcta 9 t 99 a tcttacctggacacttttagccagatacaaag 
tcacatggacncagtccttcccctgaccaacttgtctcttatcccaaaacaccctticaactccl 
ttacgaaggggtcaaacttgatccagtattatggattttatacaagttatgttcttctttcaggc 
ttatccagaatcagatcacagctaaggggactgcccagctggcagatgcgttacagagcaacact 
ggcataacagagatttggtaagatcccagcgtttgccacag^aaEaalalcagtglcfg??? 3 " 
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caccaccactgactgtgcaaggcacaacgcagggtggtttctgtttattcctccagcaaccctgc 
acagtaatggtattacctctgttttacagaggtagacagaggcccagaccagtgaaataaggttg 
cccaaggtcactacgagagaagctagaattcagcccagaatgcctgattccatattctgtgctct 
cctgccctgggcccccgccctcatctaccttcattgggtgggatgggggaagtggccagtgaaat 
gatttcctagtggaagtaaatccccctgggactcagcaattgagagatgactgtgttggccagga 
gtttggagctcattcttccccttttctgggttccgtaagacatttccaggctgacttgaactgac 
ctgtgctctttgtctacttcttttttctgctttgagaacttccttatgctaatagaagaaaaaaa 
gtttgctttactgtgacattgagcgccatgccacttctttcttgcctcccataaggcacagacac 
tccccactcagcagctcccttaacaacttaattgcctgggtgacgtgggactgggtggatgctgg 
gagaggggccttattaactatgtcctcctttcatgactggggagaatttcatagccaattaaaaa 
aaaacaaaaaacagctccttggccaacacaggctcctcatacagtgttttttaaactttgcttta 
gaacttgtttggaacttgtcataaaatcgatcagtttggtgaattgcaaccaacaatatttaaaa 
agaaaacagaacagaacaaaatatcaggatgcaatgtgcatggtatgaaagtatcatttcattca 
tcttagttcatgcttgcatgtgagtgggtgtgtgtttgcataagtgttggttcacaacataaaat 
gtaattcttatttagggttgtagacaaaaggtttttttttaaaaaaaacactgttggctaggcat 
ggtggctcatgcctgtaataccagcactttgggaggccaagatgggcagatctcttgagcacagg 
agtttgagaccagcctgggcaacatgcgaaaccccgtcactacaaaaattagcccgacatggtgc 
tatgtgcctgtagtcccagctactcaggaagctgatgtgggaggatggatgcatgggagatcaag 
gctgcagtgagccaggatcatgccactgcaatccagcctgggtgccagagaccctgtctcaaaaa 
acaaaaaagaaaaaaagaaaaacaccatcatagagaatagagcccagatctaaacagacacctgt 
ggcctgtgtgcctgcgaagcccagcctgcccagcagcctgggaagcactggagggcactggaact 
gtttgcatgggtgtttgccctcaggccactccgtttctgctgattcttaagttttgaggacagca 
ggcagagggggagaggaaggagactgccagactacagaacagtttgcagagcacagttggcttcc 
acttttctctgtagctggtcaggcgggtagtaaagacctacagttgctttaattctgtcaagttt 
caaaatctgcattgcttccctcttgagggtcaccattcctacacaaggaaccattttagtagggc 
caggagacttcagcttcaaggcctgcacttgtgtcagggtggagaggggaactggccaccaattc 
agagagggcaggacaggcggcatgggtgctggtcttgggagtgtcttcacttaggtccctggctt 
gttctgggagcctccagagcatgctcctctgtgtgtgacttcatgggactgggctctgagaaggc 
tgtggctttgttggccctgccagggactgccacaccaggccacagggttgtggttgagctggccg 
gggagccacgttcagggagcagctctgcttggagccaacacttacagagtaagccttctccttgg 
acttgttaactgtactgacacttatttctacctcattcctttctgaaaataacttggaagtctga 
agtcccttgatgagttctgtctttaagaacagaaattagaggtgaacaatgaacactgtaaatta 
cagaaatgtatcccactccagtataacagctttctgtgaggctatctcctccagactgtggctct 
gggagggtggggcctgagtcaaggtcctagggactagtgctgtgtcttcatttattccttgaata 
acgaaacgcttgagcatcagggactgtgctagcaccaaaaatccagtggtgaacaacatggcttc 
atgggttcactgtctagaaagggagaagcacattaaagaaaaaatcatttgcgtaattatttaat 
tacaactgtgatgggtactatcacaaaggggaaggccaagagggaacctgatttagatgaggttg 
cagggaaggcctctctgaggaagcagcacttacactaagccatgaaggatgaataggagctagtc 
agctgaggtgagtattctgcgtagggaacagcatgtgcaaagggtctggggcaggagggagtgtg 
gtgtcctggaagaactgccagaagctgctgtgccccagggttcagacagtgtggaagaggggact 
acaggaggctgaggagataggcagggactggaccataaaagatctgtgggtcatgatgtgcattt 
tggtctttatcctaaaagtgatggaaagtcagtgaacagtttgaagcaggagaggcatgtgatca 
gatctgcaatgcaaaaagaccaattcttggctcttctaggaaactgaattggagaaggccagagt 
acgtggaaatgacctgtcagtaggacattgtactgatgcagggaagagatgatgggtgctcagac 
caagatggccggccaaagacacagaggttccagggaggcattctagattcttaggaattagggga 
gaactttgtgatacaaggaacatggggatgagaaggaaggtgtccaggttgacccccaggttact 
aacctgctcagcaggatgagagtggtccattcactaagccaggggaccctaggaggtgtggctac 
tttgaggtgtgggggagaggtccaagtgaggatgccaagcaggtaactgcctccacggacataca 
aacaaggccgtggcattgatgagatcgggtggggaaaagggcttagccccaaacctggaggaaat 
ctcagatgtagaggtcacatggaggagaatataggaaaggaaattgaagtagagtgctcagatgc 
aggagaaaaatcagcgcatataaccaagccaaggggagggagtgcctcaagaaggagggagagga 
gaggtcaggacagccaaaatcctgagggccaagaaagacaagacctggaaaatgtcattaaattc 
aggcttatggaggctacaggtgaccttagtgagacccagtgaacagagggatggcagctggagag 
gatccatgctaatatgaaggaactatctgcaaagggtatgtcccttaatttcagggatacatgtg 
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tattgtgtgatacacgagtgtgtgctatgaacacaccttgggaaggagtgtgcgaggatccttaa 
cattttacctgtgtacttttgtcttcctccttttcaacagcctaaatggaaacctgataaaacca 
gaggaggccaaagtctatgaagatgagaagcggattatctgtttctgagaggatgctttcctgtt 
cacggggtttttgccctggagcctcagcagcaaatgccactctgggcagtcttttgtgtcagtgt 
ctcaaaggggcctgcgcaggcgggactatcaggagtccactgcctccatgatgcaagccagcttc 
ctgtgcagaaggtctggtcggcaaactccctaagtacccgctacaattctgcagaaaaagaatgt 
gtcttgcgagctgttgtagttacagtaaatacactgtgaagagactttattgcctattataa 



FIG. 18 {10 Of 10) 



BNSDOCID: <WO 0100826A2_L> 



WO 01/00826 



PCTAJS00/17691 



1 GTCGACCCACGCGTCCGGCAGCAGGCAGGCTGCAGCAGGCGAGCAGCAGCAAGAGTAAAAGG 
CAGCTGGGTGCGCAGGCCGTCGTCCGTCCGACGTCGTCCGCTXXJTCGTC 



63 tgaccgcggctgcccaccccagagccatcgggcgggc^ 

actggcgccgacgggtggggictcggtaccccgcccgtgctctacggtaggacc^xx:gagaa 

l^MGRARDAI LDAL 



125 GAJ^.CTTGTCAGGGGATGAACTCAAAAAGTTCAAGATGAAa 
CTITTX^CAGTCCCCTACTTGAGTTTTTCAAGTTCT 
13> E N L SGOEL KKFKMKL LTVQL R 



187 AGAAGGCTATCGGCGCATCCCACGCGGGGCCCTGCTGCAGATGGACGCCATAGATCTC^ 
TCTTCCGATACCCGCGTAGGGTGCGCCCCGGGACGACGTCTACCTGCGGTATCTAGAG^ 
33> EGYGRI PRGALLQMDA I DLT 



249 ACAAACTTGTCAGCTACTATCTGGAGTCGTATGGCTIX3GAGCTC^ 

TGTTTGAACAGTCGATGATAGACCTCAGCATACCGAACCTCGAGTGTT^ 
54>DKLVSYYL ESYGL ELTMTVL R 



311 GACATGGGCTTACAGGAGCTGGCTGAGC1AGCTGCAAACGACT 
CTGTACCCGAATGTCCTCGACCGACTCGTCGACGTITG^ 
75> D M G L Q E L A EQLQTTKEESGA V 



373 GGCAGCTGCAGCCAGTGTCCCIXSCTCAGAGTACAGCCAGAACAGGAC^ 
CCGTCGACGTCGGTCACAGGGACGAGTCTCATGTCGGTC^^ 
9S> AAAASVPAQSTARTGHFVDQ 



435 ACAGGCAAGCACTCATTGCCAGGGTC^CAGAAGTGGACGGAGTGCTGGATG 

TGTCCGTTCGTGAGTAACGGTCCCAGTCTCTTCACCTGCCTCACGAC 
116»H RQAL I ARVTEVDGVLDALHG 



497 AGTCTGCTGACTGAAGGACAGTACCAGGCAGTTCGTGCAGAGACCACCA 

TCACACGACTGACTTCCTGTCATCGTCCGTCAAGC^ 
137^ SVLT EGQYQAVRA ETTSQDKM 



559 GAGGAAGCTCTTCAGCTTrGTTCCATCCTXXlAACCTG^ 

CTCCTTCGAGAAGTCGAAACAAGGTAGGACCTTGGACTGGACGTTCCTGAGGGAGG^ 
157> RKLFSFVPSWNLTCKDSLLQ 



621 CCTTGAAGGAAATACATCCCTACTTGGTGATGGACCTGGAGCAGAGCTG^ 

GGAACTTCCTITATGTAGGGATGAACCACTACCTGGACCTCGTCTCGACTC 
178>A LKEI HPYLVMDLEQS 



683 AGCTACATTATCTAGCTCCTGACTTIGTATACACAATTITTGAAA 
TO^TGTAATAGATCGAGGACTGAAACATATGTGTTAA 



745 GTTTAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 
CAAA riTlTlTmTlTlTlTi ' r CCCGCCGGCG 
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1 CGCGTCCGGCTGCA.GCGGGGTGAGCGGCGGC\GCGGCCGGGGATCCTGGAGCCATGGGGC 
GCGC^GGCCGACGTCGCCCCACTCGCCGCCGTCGCCGGCCCCTAGGACCTCGGTACCCCG 

I> M G 

61 GCGCGCGCGACGCCATCCTGGATGCGCTGGAGAACCTGACCGCCGAGGAGCTCAAGAAGT 
CGCGCGCGCTGCGGTAGGACCTACGCGACCTCTTGGACTGGCGGCTCCTC 
3>R A R D A I LDALENLTA EELKK 



121 TCAAGCTGAAGCTGCTGTCGGTGCCGCTGCGCGAGGGCTACGGGCGCATCCCGCGGGGCG 
AGTTCGACTTCGACGACAGCC^CGGCGACGCGCTCCCGATGCCCGCGTAGGGCGCCCCGC 
23> F K L K L L SVPL REGYGR I PRG 



181 CGCTGCTGTCC^TGGACGCCTTGGACCTCACCGACAAGCTG^ 

GCGACGACAGGTACCTGCGGAACCTGGAGTGGCTGTTCGACCAGTCGAAGATGGACCTCT 
43»A LLSMDALDLTDKLVSFYL E 



241 CCTACGGCGCCGAGCTCACCGCTAACGTGCTGCGCGAC^TGGGCCTGCAGGAGATGGCCG 
GGATGCCGCGGCTCGAGTGGCGATTGCACGACGCGCTGTACCCGGACGTCCTCTACCGGC 
63>T Y GA EL TA NVL RDMGL QEMA 



301 GGCAGCTGCAGGCGGCCACGC^CCAGGGCTCTGGAGCCGCGCCAGCTGGGATCCAGGCCC 
CCGTCGACGTCCGCCGGTGCGTGGTCCCGAGACCTCGGCGCGGTCGACCCTAGGTCCGGG 
83>G Q L QA A T HQGS G A - A P A G I Q A 



361 CTCCTCAGTCGGCAGCCAAGCCAGGCCTGCACTTTATAGACCAGCACCGGGCTGCGCTTA 
GAGGAGTCAGCCGTCGGTTCGGTCCGGACGTGAAATATCTGGTCGTGGCCCGACGCGAAT 
103»P PQSAAKPGLHF I DQHRAAL 



421 TCGCGAGGGTCACAAACGTTGAGTGGCTGCTGGATGCTCTGTACGGGAAGGTCCTG^ 

AGCGCTCCCAGTGTTTGCAACTCACCGACGACCTACGAGACATGCCCTTCCAGGACTGCC 
123> I A RVTNVEWt L DA LYGKVL T 



481 ATGAGCAGTACCAGGCAGTGCGGGCCGAGCCCACCAACCCAAGCAAGATGCGGAAGCTCT 

TACTCGTCATGGTCCGTCACGCCCGGCTCGGGTGGTTGGGTTCGTTCTACG^ 
143>D EQYQA VRA EPTNPSKMRKL 



541 TCAGTTTCACACCAGCCTGGAACTGGACCTGCAAGGACTTGCTCCTCC^ 

AGTCAAA^GTGTGGTCGGACCTTGACCTGGACGTTCCTGAACGAGG^ 
163»F S FT PAWNWTCKDL L L QA L R 



601 AGTCCCAGTCCTACCTGGTGGAGGACCTGGAGCGGAGCTGAGGCTCCTTCCCAGCAACAC 

TCAGGGTCAGGATGGACCACCTCCTGGACCTCGCCTCGACTCCGAGGAAGGGTCG 
183>E SQSYLVEDLERS 



661 TCCGGTCAGCCCCTGGCAATCCCACCAAATCATCCTGAATCTGATCrriU'rATACACAAT 
AGGCCAGTCGGGGACCGTTAGGGTGGTTTAGTAGGACTTAGACTAGAAAAATATGTGTTA 



721 ATACGAAAAGCC^GCTTGAA 
TATGCTTTTCGGTCGAACTT 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hCARD5-DNA 740 aa vs. 

> mCARD5-DNA 7 63 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

68.2% identity; Global alignment score: 2377 

10 20 30 
inputs C GCGTCCGGCTGCAG-CGGGGTG AGCG-GCGGCAGC GGC 



CCACGCGTCCGGCAGCAGGCAGGCTGCAGCAGGCGAGCAGCAGCAAGAGTAAAAGGTGAC 
10 20 30 40 50 60 

40 50 60 70 80 90 

input S CGGGGAT CCTGGAGCCATGGGGCGCGCGCGCGACGCCATCCTGGATGCGCTGGA 



CGCGGCTGCCCACCCCAGAGCCATGGGGCGGGCACGAGATGCCATCCTGGACGCTCTTGA 
70 80 90 100 110 120 

100 110 120 130 140 150 

i npu t s G AACCTGACCGCCGAGGAGCTCAAGAAGTTCAAGCTGAAGCTGCTGTCGGTGCCGCTGCG 



AAACTTGTCAGGGGATGAACTCAAAAAGTTCAAGATGAAGCTGCTGACAGTGCAACTGCG 
130 140 150 160 170 180 

160 170 180 190 200 210 

input s CGAGGGCTACGGGCGCATCCCGCGGGGCGCGCTGCTGTCCATGGACGCCTTGGACCTCAC 



AGAAGGCTATGGGCGCATCCCACGCGGGGCCCTGCTGCAGATGGACGCCATAGATCTCAC 
190 200 210 220 230 240 

220 230 240 250 260 270 

inputs CGACAAGCTGGTCAGCTTCTACCTGGAGACCTACGGCGCCGAGCTCACCGCTAAC-GTGC 



TGACAAACTTGTCAGCTACTATCTGGAGTCGTATGGCTTGGAGCTCAC-AATGACTGTGC 
250 260 270 280 290 

280 290 300 310 320 330 

inputs TGCGCGACATGGGCCTGCAGGAGATGGCCGGGCAGCTGCAGGCGGCCACGCACCAGGGCT 



TTAGAGACATGGGCTTACAGGAGCTGGCTGAGCAGCTGCAAACGACTAAAGA--AGAG-T 
300 310 320 330 340 350 

340 350 360 370 380 390 

input s CTGGAGCCGCGCCAGCTGGGATCCAGGCCCCTCCTCAGTCGGCAGCCAAGCCAGGCCTGC 



CTGGAGCTGTGGCAGCTGCAGCCAGTGTCCCTGCTCAGAGTACAGCCAGAACAGG AC 

360 370 380 390 400 410 

400 410 420 430 440 450 
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input s ACTTTATAGACCAGCACCGGGCTGCGCTTATCGCGAGGGTCACAAACGTTGAGTGGCTGC 

ACTTTGTGGACCAGCACAGGCAAGCACTCATTGCCAGGGTCACAGAAGTGGACGGAGTGC 
420 430 440 450 460 470 

460 470 480 490 500 510 

i npu t s TGGATGCTCTGTACGGG AAGGTCCTGACGGATGAGCAGTACCAGGCAGTGCGGGCCGAGC 



TGGATGCTTTGCATGGCAGTGTGCTGACTGAAGGACAGTACCAGGCAGTTCGTGCAGAGA 
480 490 500 510 520 530 

520 530 540 550 560 570 

inputs CCACCAACCCAAGCAAGATGCGGAAGCTCTTCAGTTTCACACCAGCCTGGAACTGGACCT 



CCACCAGCCAAGACAAGATGAGGAAGCTCTTCAGCTTTGTTCCATCCTGGAACCTGACCT 
540 550 560 570 580 590 

580 590 600 610 620 630 

inputs GCAAGGACTTGCTCCTCCAGGCCCTAAGGGAGTCCCAGTCCTACCTGGTGGAGGACCTGG 

GCAAGGACTCCCTCCTCCAGGCCTTGAAGGAAATACATCCCTACTTGGTGATGGACCTGG 
600 610 620 630 640 650 

640 650 660 670 680 

inputs AGCGGAGCTGAGGC-TCCTTCCCAGCAACACTCCGGTC-AGCCCCTGGCAAT-CCCAC-C 

AGCAGAGCTGAGGTATCTTTTCCAGCTACATT ATCTAGCTCCTGACTTTGTATACAC 

660 670 680 690 700 710 

690 700 710 720 730 740 

inputs AAATCATCCTGAATCTGATCTTTTTATACACAATATACGAAAAGCCAGCTTGAA 



AATTTTTGAAAAAACAATT _ tGTATTTGTGTTTAAAAAAAAAAAAAAAAAAAGG 
720 730 740 750 760 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hCARD5 -protein 195 aa vs. 

> mCARD5 -protein 193 aa 
scoring matrix: paml20.mat, gap penalties: -12/ -4 

71.8% identity; Global alignment score: 712 

10 20 30 40 50 60 

inputs MGRARDAILDALENLTAEELKKFKLKLLSVPLREGYGRI PRG ALLSMDALDLTDKLVS FY 



MGRARDAILDALENLSGDELKKFKMKLLTVQLREGYGRIPRGALLQMDAIDLTDKLVSYY 
10 20 30 40 50 60 

70 80 90 100 110 120 

inputs LETYGAELTANVLRDMGLQEMAGQLQAATHQGSGAAPAGIQAPPQSAAKPGLHFIDQHRA 



LESYGLELTMTVLRDMGLQELAEQLQT-TKEESGAVAAAASVPAQSTARTG-HFVDQHRQ 
70 80 90 100 110 

130 140 150 160 170 180 

inputs ALIARVTNVEWLLDALYGKVLTDEQYQAVRAEPTNPSKMRKLFSFTPAWNWTCKDLLLQA 



ALIARVTEVDGVLDALHGSVLTEGQYQAVRAETTSQDKMRKLFSFVPSWNLTCKDSLLQA 
120 130 140 150 160 170 



190 

inputs LRESQSYLVEDLERS 



LKE IHPYLVMDLEQS 
180 190 
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1 CCCGCGTCCCGACTTCCCTI>(XAGTGTITC 

GGGCGCAGGCCISAAG«^GGTCACAAACAAGGAGAGA^ 



66 GCATGTTTTATCTITGCTAAGTAGGATTTCTGTCT^^ 

CGTACAAAATAGAAACGATTCATCCTAAAGACAGAAAGAAAC^ATTGTC^ 



131 CAGAATGACCTGATCCATTTCCTGCITTGTAGAAAGC^ 
GTCTTACTGGACTAGGTAAAGGACCAAACATCTTTCG^ 

1>MASEGASSE 



196 ATX2ATAGAAAAACAGCGAACAAAGTTGCTCAGTGTCCTCC 

TAGTATCTTTTTGTCGCTIXnTTCAACGAGTCACAGGAGGT^ 
10> I I EKQRTKLLSVLQQDPDS I LD 



261 CACGTTAACCTCTCGGAGACrGATTTCTCAGGAGGAGTATGAGACTCT 
GTGCAATTGGAGAGCCTCTGACTAAAGACTCCTCCTCATACTC^ 
21* TLTSRRL I SEEEYETL EA 1 TD 



326 CTCTGAAGAAAAGCCGGAAGCTGTTAATTITGATCCAGA 
GAGACITCTTTTCGGCCTTCGACAATTAAAACTAGGT^ 
53>P L KKSRKLL 1 L I OKKGEDSCCC 



391 TTCCTCAAGTCTCTXHCTAATGCCTTTC^ 

AAGGAGTTCACAGACAGATTACGGAAAGGTGTCAGTCGAAGGT^ 
75* FLKCLSNAFPQSASTLGLKQEV 



456 TCCACGGCAGGGGACTXXLAGAGGTTGTCGAGGTGAGCAGGGGTTTGG 
AGGTGCCGTCCCCTGACCTCTCCMCAGCTCCACTCXr^ 
96* PRQGTGEVVEVSRGLEDPFSL 



521 GGACCATAACCCCAGAAATAGCAGAGCTCTCAGAAGAGAAAGAATGCCCGGGTCTGGGAGCTCCG 

CCTGGTATTGGGGTCTITATCGTCTCGAGAGTCTTCT^ 
118* G Tl TPEIAELSEEKECPGLGAP 



586 GAGTTCTTCACCTGCAAGGAAAGCAGCCACAGGGAACCGGAAGTACCTTCTO 

CTCAAGAAGTGGACGTTCCTTTCGTCGGTGTrc^ 
140* EFFTCKESSHREPEVPSWENOE 



651 AGGGCGTGGTGCACAGCAAGTCACCGCTCCGCGTTC^GTOiAAGGAGTn^ 

TCCCGCACCACGTGTCGTTCAGTGGCGAGGCGCAAGTCAGTITCCTCA^ 
IQK . <?. fL _ A . P. _Q . Y. - T - - A - . ?. - R - -? . Y. A _G _ V_ JE_ _Y _ E. _V _P _ . 

FlCi. 2.S Ciop7) 
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716 CAAGTATCTCCCTCTTAAGCGACGGGCAGA^ 

GTTCATAGAGGGAGAATTCGCTGCCCGTCTCTATGC^ 
183>A SI SLLSDGQRYEEPDDSLYLE 



781 GAAGGGGAAGGTGAAGAGTCTCTTGGGTACCCTGAAGATGTTTTGGAGGA^ 

CTTO:CCTTCCACTTCTX»GAGAACC^^ 
205» EGEGEESLGYP EOVL EEGA GDD 



846 CCCACAGTCCITICTATATGATAGTX^GG^ 

GGGTGTCACGAAACATATACTATCACTCCTCCTTACGCI^^ 
226* PQCFVY DSEEECEYE ENMGSS 



911 GTGAAGACAGTAGCTGCGACGACACTTCAGAGACCTGCGTTCCA^ 

CACTTCTGTCATCGACGCTGCTGTGAAGTCTCTGGACG^ 
248>G EDSSCDDTSETCVPLEGEKSA 



976 GAAGAAAGAAAAAGAGTGTTTCAACACGTCCTGTCCrGTTTGAACA 

CTICITrCTmTCTCACAAAGTrGTGCAG^ 
210* EERKRVFQHVLSCLNMDRNRKL 



1041 TCTCCCAGAGTIX^TGAGGCAGTTITCCATAGACCGAGGATGTGAGTG^ 
AGAGGGTCTCAAGCACTCCGTCAAAAGGTATCTGGCTCCT^ 
291> IPEFVRQFSl DRGCEWTPKTP 



1106 GAGAC^AGCTTGGAA UTIO ' I GATCAAAGTTCAGCCTTTAGACTC 
CTCIX^AATCGAACCTrAAAGAACTACTTTCAAGTCCGAA^ 
313>G DLAWNFLMKVQALDSTAROSl 



1171 CTGAGGCCCGAGGTGGCGOTIGAAGAGAATGAAGAATT^ 
GACTCCGGGCTCCACCGCCCACTTCTCTTACTra 
335> LRPEVA GEENEELPAG1 EKLGl 



1236 TCGAGACCCCCAAACCATCCATCCCCTCGATGTCCTCT 

ACCTCTGGGGGTTTGGTAGGTAGGGGACCTACAGGAGACGCGGACGTACGAAACAC 
356> GDPQTI HPLDVLCACMLCAOS 



1301 CCTTGCAGCGTGAAGTCATGTCAAACATGTACCAATGC 

GGAACGTCGCACTTCAGTACAGTTTGTACATGGTTACGGTCAAACGAGAAGGGG^ 
378>S LQREVMSNMYQCQFALPLLLP 



1366 GATGCTGAGAACAACAAAAACCTCTTAATGGTAGGGGC 
CTACGACTCTTGTTGTITITGGAGAATTACCATCCCC^ 
400> DAENNKNLLMVGAMKOLKQPSA 
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1431 ACACTCCTCAGGAGGGCCCCTCAGGGAAACA 

TGTCAGGAGTCCTCCCGGGGAGTCCC1T rGTCTGTGTAAAGACCCAGAGTGTTTCTACGGACAGT 
421> QSSGGPLRETDTFLGLTKMPV 



1496 TxrrCTrTTGTGCGACTAGGACGCTGCAGCTTC^ 

AGAGAAAACACGCTGATCCTGCGACGTCGAAGAC^TTCAGGTCT^ 
443> I SFVRLGRCSFSKSRI VNTULS 



1561 TCCTCCCAGCAGAAACCATACCCGATTTTCCTCCATCAGG^ 

AGGAGGG1CGTCTTTGGTATGGGCTAAAAGGAGGTAGTCCTAGACAG 
465> SSQQKPYPI FLHODLSVPVLPR 



1626 GC\AATTTCTGACGGCCTGGTGGAAGTGACATGGTCCTTTC CTGACAAGTTGCTGAAGGAAAGCC 
CGTTTAAAGACTGCCGGACCACCTTCACTGTACCACGAAAGGACTGTTCA^ 
486> Ql SDGLVEVTWCFPDKLLKES 



1691 CGCATGCTTTCCAGAAACCTGTTGCTGTCGCCAACCTTC 

GCGTACGAAAGGTCITTGGACAACGACACCGGTIGGAAGCACCTCTAA^ 
508>P HA FQKPVAVANLRGDLESFWI 



1756 CAATTTGGTTTCCTGGTAGAAGTTTCCTCCGGTCTIT^ 

GTTAAACCAAAGGACCATCITCAAAGGAGGCCAGAAAAGAAAAAGTGTCTG^ 
S20> QFGFLVEVSSGLFFFTDCLGEK 



1821 GGAATGGGACTTGCTAATGTrTITAGGAGAGGACACC^TTGAACGGTGCT 
CCTTACCCTGAACGATTACAAAAATCCTCTCCTGTGGTAACTTG^ 
SSI* EWDLLMFLGEDT I ERCYFI LS 



1886 CCCAGGCTAAGGAGAGTGAAGAAGCCCAGATTTTCCAAAGGATCCTAA 
GGGTCCGATTCCTCTCAC1TCTITO3GTCTAAAAGGTTTC 
573>P QAKESEEAOI FORI LKLKPSQ 



1951 CTACTGTITIGGGAAGCT^GGAAGCTGGGGATAGAAGGAAGACTATGGAGGC 
GATGACAAAACCCTTCGACTCCTTCGACCCCTATCTKXTICI^ 
595> L L FWEA E EA GDRRKTMEA L QAA 



2016 CCTCCAGGAAGTAATGTCCTCTCCACTCAGATGTGTG^ 

GGAGGTCCTTCATTACAGGAGAGGTGAGTCTACACACAGGGAACTTCT^ 
616> LQEVMSSPLRCVSL EEMA SLA 



2081 GGGAGCrGGGCATTCAGGTAGACCSAGACTTOGAAGTrACTCAA 

CCCTCGACCCGTAAGTCCATCTGGTTXTTGAAACTICAATGAGTTCTATAA 
638» RELGI QVDODFEVTQDl QVSPT 
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2146 AC^GTTCAAGGTGAAAACCAACAACCATGTAGTtt^ 
TGTC\ACTTCCACTITTGGTTGTI>GGTACAT^ 
660> TVEGENQQPCSQTKSPA ESGAQ 



2211 GGAGCCAATCAGAGAGCCAGGGGCIX^ 

CCTCGGTTAGTCTCTCGGTCCCCGAGTTACACTGC^ 
681> EPI REPGAQCDDSQNAPVFHQ 



227 6 CTCOAGTATACATGCCTTATCCAGCACACCCATGGGCriTGGCCATC^ 

GAGGTCATATGTACGGAATAGGTCGTGTGGGTACCCGAAACCGGTAGTTTCGACCTCCATTGAAA 
703>T PVYMPYPA HPWALA I KAGGNF 



2341 TACCACGTTCCTTIX2AATGCCCCCTGGTTATG« 

ATGGTGCAAGGAAACTTACGGGGGACCAATACCCGAGGGTGA^ACCTAGTGTCGTCTCCCGATTC 
725> YHVPLNA PWLWA PTL DHSRGL S 



2406 TGGTTCTrrCCATTCCCATGCTAAACCCACTCACTCTAAGGCC^ 
ACCAAGAAAGGTAAGGGTACGATTTGGGTGAGTGAGATTCCGGA^ 
746> GSFHSHA KPTHSKAFQANCHH 



2471 CCCATCCCTCCCATGCTAAACCCACTCATGTGA^ 

GGGTAGGGAGGGTACGATTTGGGTGAGTACACTTAGGGAGAGTACGATTGGGGTGAGTAC^CGTC 
768»P HPSHA K PTHVNPSHA NPTHVQ 



2536 CCTIGC^TGCTAAACCCACTCACTCTAAGGCCTTCC^ 

GGAACGTACGATTIX^TGAGTGAGATTCCGGAAGGTTCGATTTGGGTG^ 
790> PCMLNPLTLRPSKLNPLPLRPL 



2601 TCGAGCCAAGCTAACTGCAATCATGCCCATCCCTCCCTTX^^ 

ACCTCGGTTCGATTGACGITAGTACGGGTAGGGAGGGAACGATTTGGGAG^ 
3X1* GAKLTA I MPI PPLLNPL I Rl P 



2666 TGATCCTAACCCCACTCATtn^SCAGCCTTCCCATC 

ACTACGATIGGGGTGAGTACACGTCGGAAGGGTACGATITGGGCGAGTAGATGT^ 
833>L MLTPLMCSLPMLNPL I YSLPK 



2731 CAAAACCCTCCCCATCCCAATCTACTGCAGTTCACGGCIACACAAACCTC^ 
GTTTTGGGAGGGGTAGGGTTAGATGACGTCAAGTC^CGTG^ 
855> QNPPHPNLLQFTAHKPQQSQSK 



2796 GCCTICTCAGCAGAGACCCAGTCAGCCTAAATCATTCCAGACCAAGCCTTC^ 

CGGAAGAGTCGTCTCTGGGTCAGTCGGATTTAGTAAGGTCTGGTTCGGAAGTGTC 
876> PSQQRPSQPKSFQTKPSQA R A 
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2861 GCCACCCAAGAGCAGGGAGACX7ITAAAGAA<^TA(^^ 
CGGTGGGTTCTCGTCCCTCTGCAArrTCTI^ 
898>C H P R A G R R 



2926 TTCXTrTAACTATrcTTTTTCATATAGCAAGCT 

AACGAATTCATAAGAAAAAGTATATCGTTCGACTTCTTTTCAAAATCACT^ 



2991 AGCAAAACCCAAAAAAGGTATGCAAAGTCTTAAGTGCATAGCAAAGTATCCA^ 
TTCnTTTGGGTTTTTTCCATACGTTTCAGAATTC^^ 



3056 TGGAAGCAGTTAAAAGTAGAATCTGGCTGGGCATGGTGGCAC^^ 
ACCTTCGTCAATTTTCATCTTAGACCGACCCGTACCAC^^ 



3121 AGGGCTCTGTCATCC CAACTCAGAGAAGCAGGCAGATCTCTG J K^ 

TCCOSAGACAGTAGGGTTGAGTCTCTTCGTCCGTCTAGAGACACACAAACTCCG^ 



3186 ACATAACAACGACACAAGCAAGTCCTACATCAGCCATACTACAAAATGAGACCCCATCTGGGGAC 
TGTATTGTTGCTGTGTTCGTTCAGGATGTAGTCGGTATGATGTm 



3251 AAAAGGGTIGGATCTAACATCAAACCAAAGAAATCAGTCAACTA 
TTTTCCCAACCTAGATTGTAGTTTGGTTTXinTTAGTCAGTrCA 



3316 ACACTCAGTGGGTTACCACIAACCAAACCATACTCGACAACTAACCCCCTAAAGGAGCAAGAAGGA 
TGTGAGTCACCCAATGGTGTTGGTTOXTATX^O 



3381 GTTGGGTGGGTGTTAGGCTGAACATGATOGGGGAAGAACTGA^ 
CAACCCACCCACAATCCGACTTGTACTAACCCCTTC^^ 



3446 ACAGGTTATGGGACTrcTCAAATCCATTAAATGCAATATTAAGAAGCAGTGGGAATC^ 
TGTCCAATACtTCTGAACAGTITAGGTAATrTACGTrATAATKnTCGTCA 



3511 ACATTAAGCTCCAGTGAGTCGCAACCCTCCCCTATTAGATGATGTGAGATTTGAACCCCAG^ 
TGTAATTCGAGGTCACTXAGCGTrcGGAGGGGATAATCTACT 



3576 TGGGGTGTCTCTGATAGCCCGTGTGTGTGACAAACTC 
ACCCCA(^CAGACTATCGGGCACACACACTGT^^ 



3641 AGTTCAGCTTATCTGTX^TGAAGAAAGGCrGCTTCAGAG^ 

TCAAGTCGAATAGACACAACTTCTTTCCGACGAAGTCTCCACGGAACCAAAACCC 
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3706 GCCACTGAGCAGATACTCTGCACCATTGGTACAGTTAAATCAGCTTGC^^ 
CGGTGACTCGTCTATGAGACGTGGTAACCATGTCAAT^ 



3771 ATCTACCACATITATCCCTTACAGGCGGAAATAATG^ 

TAGATGGTGTAAATAGGGAATGTCCGCCTTrATTACTrACT^ 



3836 TAACCTTGCTACTGATTTGTATATGTATCATTC 

ATTGGAACCATGACTAAACATATACATAGTA^GAAATATArTATCGAlTCn'l'l'AAATCGAGTAA 



3901 AGGGGTTCTGATATATTAGTTTAATGGTirrGAAGTO 
TCCCCAAGACTATATAATOUUVTTACCAAACTTCAG^ 



3966 TAATTGAAAATATTCAGATGAATTTACAAAGGCTATA^ 
ATTAACTITTATAACTCTACTTAAATGTITC 



4031 GTAGACTCATACIGTTCTGAACATITGGATAGCTTCTCGTAGTTAGCAG 
CATCTGAGTATGAC^AGACTTXTTAAACCTATCGAAGAGCATCA^ 



4096 ATTTCIATTCAGGTATITAACCAGAGCTGCrCTTAGT'^^ 

TAAACTAAGTCCATAAATK^TCTCGACGAGAATCAAAAATTCACAGTGGTTC^ 



4161 GCTAC^TTATCTGAAGATGTGGGAACACAACTGTGACCTTACA 
CGATGTAATAGACTTGTACACCCTTCTGTTGACACT^ 



4226 ATCAAGGTTCAAGCCAGCAGCAC^TAGTGAGACCAGGTCTCAA^ 
TAGTTCCAAGTTCGGTCGTCGTGTATCACTCTGGTC 



4291 AGGAAGATTTTAAAATTTGCCTCATTAAGAAATAAAGTAAGATITO 
TCCTTCTAAAAl'm'AAACGGAGTAATTCTTTATITCATTCT 



4356 CATCTTrGAACTTATGACTGTTTAATITITIGACTrA 

GTAGAAACITGAATACTGACAAATTAAAAAACTGAATTTCAAATTAAAATAATAAC^ 



4421 GTICTATGTCTGTX^CATGTCTGCCACTGCATC^ 

CAACATACACACACGTGTACACACGGTGACGTACATACACCTCCGGTAGTCTGTT 



4486 TCTGTICITTCCTCTTAGCCCTATC^ 

AGACAAGAAAGGAGAATCGGGATACACAAAATGGGTGACTCGATCCGGTGGATGAGGATA 
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4551 TAATTITAAATAGTAAAATAGTTCTAAGAAGTCAATC^ 

ATTAAAATTTATCATTTTATCAAGATTCTTCAGTTAGTCCC'ri' 1" 1 " I "i'TACCGACAGTTTCAGAGT 



4616 AAGAAAAATCGTATTAGCCATGGATAGAGACTCACCIXrrTGAATCA'riUU 
TTCTTTTTAGCATAATCGGTACCTATCTCTGAGTGGAGAAOT 



4681 TAATATCACAATAATGTGTTTCTACATGTCTTAGTTAATATT&Tl'l'lCA 

ATTATAGTGTTATTACACAAACATGTACACAATC^ATrATAACAAAAGTCTCATAAATTAGAGAG 



4746 ATGATTATPGTAAAGATGAAAAAAGAAATAGTGGGCAATGTATGTGA 

TACTAATAACATTTCTACl'I" L'L'l TCT I'l ATCACCCGTrACATACACTCATAAATTAAAACGGACT 



4811 CAATTCTGTCTOTTAGAATGATAAATGTAAGAAGTAAAATAAAACGGTTCAT^ 
GTTAAGACAGAAAATCTrACTATTITACATnrTTCATTTTATTTTC 



4876 AAGCCAGCTCACTrAAGTCTGGGCCCTGCTGGCATTGGCTAGTCTAGCT 

TTCGGTCGAGTGAATTCAGACCCGGGACGACCGTAACCGATCAGATCGATGG 



4941 AAAAGTTTAGAGAAGAAAATGACTGAGTCAAGCTTGCCTAATGAC^^ 
TrTIX^AATCTCTTCTTTTACTGACTCAGTTCGAACGGA 



5006 GTCCTAGAAAGCCTTAAAATAAGTAGGATATAAAACATGTAAATTAACCCACACATTATGTGGGT 
CAGGATCTTTCGGAAriTTATTCATCCTATATTITGTACATTTAATTGG^ 



5071 TGAGAAGCAGAAAAATGTCAGTAGAACACTCGGCCAGTGCAT^ 

ACTClTCGTCTTTTTACAGTCATCTrcTGAGCCGGTCACGTATTTCTrc 



5136 TGGGTTATAAAACTGCT CmGl XXTCAATTTGTC^ 

ACCC^TATTITGACGAGAAACACGAGTTAAACAGGGGACGAAAACAAACGGT^ 



5201 TTATAAAATAAACTCACTTITACTTrrAAAAAAAAAAAAAAAAAAAGGGCGG 
AATATTTTATTTGAGTGAAAATGAAAAl'l'riTl'ri'ri'riTiTlTi'lCCCGCC 
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CACCCGTCCGCCGGATCAGAGAGTGCTCCGAGCTGGGTTGCCCCACTGTGCTTGTATCTGCACTCTCCAACACTAGGC 7 9 

ATCATTGACATGTTAAAGCTTAGCCAAATAGAATTGTTC^^ 158 

MATESTPSE 9 
GATTTCATAATATATTTCCTGGTTTAGAGGAAACAGGAACA ATG GCT ACC GAG AGT ACT CCC TCA GAG 226 

II.ERERKKLLEILQHDPDSI29 
ATC ATA GAA AGA GAA AGA AAA AAG TTG CTT GAA ATC CTT CAA CAT GAT CCT GAT TCT ATC 28« 

LDTLTSRRLISEEEYETLEN49 
TTA GAC ACG TTA ACT TCT CGG AGG CTG ATT TCT GAG GAA GAG TAT GAG ACT CTG GAG AAT 346 

VTDLLKKSRKLLILVQKKGE69 
CTT ACA GAT CTC CTG AAG AAA AGT CGG AAG CTG TTA ATT TTG GTA CAG AAA AAG GGA GAG 406 

ATCQH FLKCLFSTFPQLAAI 89 
GCG ACC TGT CAG CAT TTT CTC AAG TGT TTA TTT AGT ACT TTT CCA CAG TTA GCT GCC ATT 466 

CGLRHEVLKHENTVPPQSMG 109 
TGC GGC TTA AGG CAT GAA GTT TTA AAA CAT GAG AAT ACA GTA CCT CCT CAA TCT ATG GGG S26 

ASSNSEDAFSPGIKQPEAPE 129 
GCA AGC AGT AAT TCA GAA GAT GCT TTT TCT CCT GGA ATA AAA CAG CCT GAA GCC CCT GAG SBfi 

ITVFFSEKEHLDLETSEFFR 149 
ATC ACA GTG TTC TTC AGT GAG AAG GAA CAC TTG GAT TTG GAA ACC TCT GAG TTT TTC AGG 646 

DKKTSYRETA LSARKNEKEY 169 
GAC AAG AAA ACT AGT TAT AGG GAA ACA GCT TTG TCT GCC AGG AAG AAT GAG AAG GAA TAT 706 

DT PEVTLSY SVEKVGCEVPA 189 
GAC ACA CCA GAA GTC ACA TTA TCA TAT TCA GTT GAG AAA GTT GGA TGT GAA GTT CCA GCA 766 

TI TYIKDGQRYEELDDSLYL 209 
ACT ATT ACA TAT ATA AAA GAT GGA CAG AGA TAT GAG GAG CTA GAT GAT TCT TTA TAC TTA 826 

GKEEYLGSVDTPEDAEATVE 229 
GGA AAA GAG GAA TAT CTA GGA TCT GTT GAC ACC CCT GAA GAT GCA GAA GCC ACT GTG GAA 886 

EEVYDDPEHVGYDGEEDFEN 249 
GAG GAG GTT TAT GAT GAC CCA GAG CAC GTT GGA TAT GAT GGT GAA GAG GAC TTC GAG AAT 946 

SETTEFSGEEPSYEGSETSL 269 
TCA GAA ACC ACA GAG TTC TCT GGT GAA GAA CCA AGT TAT GAG GGA TCA GAA ACC AGC CTT 1006 

SLBEEQEKSIEERKKVFKDV 289 
TCA TTG GAG GAG GAA CAG GAG AAA AGT ATA GAA GAA AGA AAA AAG GTG TTT AAA GAT GTC 1066 

LLCLNMDRSRKVLPDFVKQF 309 
CTG TTA TGT TTG AAC ATG GAT AGA AGC AGA AAG GTT CTG CCA GAT TTT GTT AAA CAA TTC 1126 

SL'DRGCKWTPES PGDLAWNF 329 
TCC TTA GAT CGA GGA TGT AAG TGG ACC CCT GAG AGT CCA GGA GAC TTA GCC TGG AAT TTC 1186 

LMKVQARDVTARDS ILSHKV 349 
CTG ATG AAA GTT CAA GCA CGA GAT GTG ACG GCT AGG GAT TCA ATC CTC AGT CAC AAG GTT 1246 

L D E D S KEDLLAGVENLEI RD 369 
CTG GAT GAA GAT AGC AAG GAG GAT TTG CTG GCT GGA GTG GAG AAT TTG GAA ATT CGA GAC 1306 
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IQTINPLDVLCATMLCSDSS 3S9 
ATA CAA ACC ATT AAT CCC CTT GAC GTG CTT TGT GCC ACC ATG CTG TGT TCA GAT ACC TCT 13S6 

LQRQVMSNMYQCQFALPLLL 409 
TTG CAA CGC CAA GTC ATG TCA AAC ATG TAT CAG TGC CAG TTT GCT CTT CCC CTG CTA CTG 1426 

PDAENNKSI LMLGAMKDIVK 429 
CCA GAT GCA GAA AAC AAC AAA AGC ATC TTA ATG CTG GGG GCC ATG AAA GAC ATT GTG AAG 1486 

KQSTQFSGGPTEDTEKFLTL 449 
AAG CAG TCA ACA CAG TTT TCA GGG GGG CCT ACA GAG GAT ACA GAA AAG TTT CTG ACT CTC 1S46 

MKMPVISFVRLGYCSFSKSR 469 
ATG AAG ATG CCT GTC ATC TCT TTT GTG CGT CTA GGA TAC TGT AGC TTC TCT AAG TCC AGA 1606 

ILNTLLSPAQLKLHKIFLHQ 489 
ATC CTC AAC ACA CTT CTC AGC CCT GCC CAG TTG AAA TTA CAC AAA ATC TTT CTT CAT CAA 1666 

DLPLL.V L PRQ I SD GLV E I TW 509 
GAT TTG CCT CTT TTG GTG CTT CCC CGG CAA ATC TCT GAT GGC CTG GTT GAG ATA ACA TGG 1726 

CFPDSDDRKENPFFOKPVAL 529 
TGT TTT CCT GAT AGC GAT GAT AGA AAG GAA AAC CCC TTT TTC CAA AAG CCT GTT GCT CTG 1786 

ANLRGNLESFWTQFGFLMEV S49 
GCT AAT CTC CGT GGA AAT CTA GAA AGT TTT TGG ACT CAG TTT GGT TTT TTG ATG GAA GTT 1846 

S SAV FF FTDC.LGE KEWD LLM 569 
TCT TCA GCT GTG TTT TTT TTC ACT GAC TGT TTA GGT GAG AAG GAA TGG GAC TTG CTA ATG 1906 

FLGEAAI ERCYFVLSSQARE 589 
TTT TTA GGA GAG GCT GCC ATT GAA AGA TGC TAC TTT GTT CTC AGT TCC CAA GCC AGG GAG 1966 

SEEAQI FQRILNLKPAQ LLF 609 
AGT GAA GAG GCT CAA ATT TTT CAG AGG ATA CTG AAC TTG AAG CCA GCA CAG CTA CTG TTT 2026 

WERGDAGDRRKNMEGLQAAL 629 
TGG GAG AGG GGA GAT GCT GGG GAT AGA AGG AAG AAC ATG GAG GGC CTT CAA GCT GCC CTC 2086 

QEVMFSSCLRCVSVEDMAAL 649 
CAG GAA GTG ATG TTC TCT TCT TGC CTC AGA TGT GTG TCT GTG GAG GAT ATG GCC GCC CTG 2146 

ARE LGI QVDED FENTQR I QV 669 
GCC AGG GAG CTG GGG ATT CAG GTA GAT GAA GAC TTT GAA AAC ACT CAG AGA ATT CAA GTT 2206 

SSGENMAGTAEGEGQQRHSQ 689 
TCC TCT GGA GAA AAC ATG GCT GGG ACA GCT GAA GGT GAG GGT CAG CAA AGA CAC AGT CAG 2266 

LKS SSKSQALMPIQEPGTQC 709 
CTA AAA AGC TCA TCT AAA AGC CAG GCT CTA ATG CCA ATT CAA GAG CCT GGG ACT CAA TGT 2326 

ELSQNLQNLYGTPVFRPVLE 729 
GAG CTC AGC CAG AAT CTT CAG AAT CTC TAT GGT ACC CCA GTA TTC AGG CCT GTT CTA GAG 2386 

NSWLFPTRIGGNFNHVS h K A 749 
AAC TCC TGG CTC TTT CCA ACC AGA ATT GGA GGT AAC TTT AAC CAT GTT TCC TTG AAA GCC 2446 

SWVMGRPFGSEQRPKWFKPL. 769 
TCC TGG GTT ATG GGC CGC CCC TTT GGG TCA GAG CAG AGG CCT AAG TGG TTC CAT CCT TTG 2S06 
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PFQNAGAQGRGKSFGIQSFH 789 
CCT TTT CAG AAT GCA GGG GCC CAG GGC CGA GGT AAA AGT TTT GGT ATT CAA TCC TTC CAT 2566 

PQI FYSGERFMKPS R V A R G C 809 
CCC CAG ATA TTT TAT TCA GGT GAA AGA TTC ATG AAA TTT TCC AGA GTT GCT CGG GGA TGT 2626 

HSNGT FGRLPRPICQHVQAC 829 
CAC TCG AAT GGA ACA TTT GGG AGA CTG CCA AGA CCC ATT TGT CAG CAT GTA CAG GCC TGC 2686 

PERPQHMGTLERSRAVASKI 849 
CCT GAG AGA CCA CAA ATG ATG GGA ACT CTT GAA AGG TCT AGG GCA GTA GCC TCC AAG ATA 2746 

GHSYS LDSQPARAVGKPWPQ 869 
GGT CAC TCC TAT TCC CTG GAT TCA CAG CCA GCA AGA GCA GTA GGG AAG CCA TGG CCT CAG 2806 

QACTRVTELTEATGKLIRTS 889 
CAA GCT TGC ACC AGG GTA ACA GAG TTA ACT GAA GCA ACT GGA AAA CTG ATA AGA ACA TCC 2866 

HIGKPHPQSFQPAAATQKLR 909 
CAT ATT GGA AAG CCT CAC CCT CAG TCC TTT CAA CCA GCA GCA GCC ACA CAA AAA CTA AGA 2926 

PASQQGVQMKTQGGASNPAL929 
CCT GCT TCT CAG CAA GGA GTC CAG ATG AAG ACA CAA GGT GGG GCT TCA AAT CCA GCT CTC 2986 

QIGSH PMCKSSQFKSDQSNP949 
CAA ATA GGG TCC CAT CCC ATG TGC AAG AGC TCT CAG TTC AAA TCC GAT CAG TCC AAC CCA 3046 

STV KH S Q P KP .FHSV PS Q P KS 969 
TCC ACA GTC AAA CAC TCC CAG CCT AAA CCC TTC CAT TCT GTG CCC TCT CAA CCT AAA TCC 3106 

SQTKSCQSQPSQTKPSPCKS 989 
TCT CAG ACA AAA TCC TGT CAG TCC CAG CCC TCC CAA ACT AAA CCT TCT CCA TGC AAA TCT 3166 

TQPKPSQPWPPQSKPSQPRP 1009 
ACT CAG CCT AAG CCA AGC CAG CCC TGG CCT CCC CAG TCT AAG CCT TCT CAG CCC AGA CCC 3226 

PQPK S SSTNPSQAKAHHS K A 1029 
CCT CAA CCT AAG TCA TCC TCA ACC AAT CCT TCA CAA GCT AAG GCA CAC CAC TCA AAA GCA 3286 

GQKRGGKH* 1038 
GGG CAG AAG AGG GGA GGG AAG CAT TAA 3313 

AGAGCTAACTCCAGAGATCTATAAAGCATATCCTTTACCCAGGCCATTCCTATCATATAGTAAGCAGAAGAGTTGCCAT 3392 

GAAAGTAAAAGACTACTGTCATTAGCATGTAAAACAAAGAAAGATATACATGACCGAATTGGATAT ^ 3471 

TTTGAGACAGAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAATGG 3550 

GGCTTAAAGTGATTCTCCTGCCTCAGCCTCTCGAGTAGCTGGGATTACAGOCATGCACCACCACACCCAG CTAATTTTG 3629 

TATTTTTAGTAGAGGCAGGGTTTCTCCATGTTGGTCAGGCTGGTCnTGAACTCC 3708 

GGCCTCTCAAAGTGTTC^SGATTACGTGTGTAAGCCACAGTGCCCAGCCCGAATTGGATATCTTT^ 3787 

GTTATATCCCTAACaVAGAAGAAAAATATGAAAATAATTAAGACTAGAATCA^ 3866 

GGTATTATTAGATAATGTATAACTTGCACCCAGGGAATGX3GGGTCTATGAGACAACCCCACTTGGAGAAGAATGGG 394S 

AGGGTCTCTAATTGCAAAGTGACTGTACAATAGGACGAAAGTTGCCTCTGTGTCTGAGAAAGTATC^ 4024 
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GCrCCAGAGGTAT C * ri * fG TCRARAGClTCTGGTTCAATATCAGCCRCTGRGCftGAT^CCC TO CIT A TTTG G TGTGGl"r 4103 
AAATCAACTAGC'raCl\3CTAATAGCCCCAA ViTGCrT GAATGGGAAAACTCTCT 4182 
ATGAATTAAC^CCAATAAAATTAATCATTTGGCATTAAAAAAAAAAAAAAAAAAAAARAAA 4244 
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CARD6 5 STP — SEIIERERKKLLEILQHD - PDS ILDTLTSRRLISEEEYETLE 48 

CONSENSUS ragakledDKarelvdslqrrgsqaf daf idaledTgqsyLAdvLel<- * 
+ 1 + r 1++ +q++g- +••■ f + +1++ LA++ +1 

CARD 6 49 NVTOLLKK — SRKLLILVQKKGEATCQHFLKCLFS-TFPQLAAICGL 92 
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Isolated nucleic acid molecules as defined by SEQID 1,3 and 
the corresponding isolated polypeptide sequence as defined 
by SEQID 2; the recombinant expression of the same in host 
cells; an antibody binding to said polypeptide and a method 
for producing said polypeptide, a method for detecting the 
presence of said polypeptide in a sample by contacting the 
sample with the antibody or by hybridization with a probe or 
primer; a method for identifying a compound that binds to 
said polypeptide or modulates the activity of said 
polypeptide and a method of modulating the activity of said 
polypetide by contacting the polypeptide with said compound. 



2. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 
7,9,25,27,38,40 and the corresponding isolated polypeptide 
sequences as defined by SEQID 8,26,39,41; the recombinant 
expression of the same in host cells; an antibody binding to 
said polypeptide and a method for producing said 
polypeptide, a method for detecting the presence of said 
polypeptide in a sample by contacting the sample with the 
antibody or by hybridization with a probe or primer; a 
method for identifying a compound that binds to said 
polypeptide or modulates the activity of said polypeptide 
and a method of modulating the activity of said polypetide 
by contacting the polypeptide with said compound. 



3. Claims: 1-22 partially 

Isolated nucleic acid molecule as defined by SEQID 42 and 
the corresponding isolated polypeptide sequence as defined 
by SEQID 43; the recombinant expression of the same in host 
cells; an antibody binding to said polypeptide and a method 
for producing said polypeptide, a method for detecting the 
presence of said polypeptide in a sample by contacting the 
sample with the antibody or by hybridization with a probe or 
primer; a method for identifying a compound that binds to 
said polypeptide or modulates the activity of said 
polypeptide and a method of modulating the activity of said 
polypetide by contacting the polypeptide with said compound. 



4. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 
48,50,60,62 and the corresponding isolated polypeptide 
sequences as defined by SEQID 49,61 ; the recombinant 
expression of the same in host cells; an antibody binding to 
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said polypeptide and a method for producing said 
polypeptide, a method for detecting the presence of said 
polypeptide in a sample by contacting the sample with the 
antibody or by hybridization with a probe or primer; a 
method for identifying a compound that binds to said 
polypeptide or modulates the activity of said polypeptide 
and a method of modulating the activity of said polypetide 
by contacting the polypeptide with said compound. 



5. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 
54,56,51,53 and the corresponding isolated polypeptide 
sequences as defined by SEQID 55,52; the recombinant 
expression of the same in host cells; an antibody binding to 
said polypeptide and a method for producing said 
polypeptide, a method for detecting the presence of said 
polypeptide in a sample by contacting the sample with the 
antibody or by hybridization with a probe or primer; a 
method for identifying a compound that binds to said 
polypeptide or modulates the activity of said polypeptide 
and a method of modulating the activity of said polypetide 
by contacting the polypeptide with said compound. 
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